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Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



October 15, 2004, 15:11:57 ; Search time 3229.12 Seconds 

(without alignments) 
10265.033 Million cell updates/sec 

US-10-070-532-3 
1110 

1 atggagccctcagccacccc ttccctggagtctgctctaa 1110 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 27513289 seqs, 14931090276 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



55026578 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database 



EST: * 
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em_estba: * 
em_esthum: * 
em_estin: * 
em_e s tmu : * 
em_estov: * 
em_estpl : * 
em_estro: * 
em_htc: * 
gb^estl : * 
gb_est2:* 
gb_htc : * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
em_estfun: * 
em_estom: 
em_gss_hum: * 
em_gss_inv: * 
em_gss_pln: * 
em_gss_vrt : * 
em_gss_f un : * 
em_gss_mam: * 
em_gss_mus : * 
em_gss_pro: * 
em_gss_rod: * 
em_gss_phg : * 
em_gss_vrl: * 



28: gb_gssl:* 
29: gb_gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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DB 


ID 
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1 


729.2 


65. 


7 


1740 


11 


BC035686 


BC035686 


Homo sapi 


c 


* 2 


696 


62. 


7 


886 


13 


BX433093 


BX433093 


BX433093 


c 


3 


650.4 


58. 


6 


899 


13 


BX433092 


BX433092 


BX433092 




4 


554.4 


49. 


9 


753 


29 


AY420885 


AY420885 


Homo sapi 




5 


521.4 


47. 


0 


3470 


11 


AK048781 


AK048781 


Mus muscu 




6 


521.4 


47 . 


0 


3729 


11 


AK038551 


AK038551 


Mus muscu 




7 


511.8 


46. 


1 


1790 


11 


BC035858 


BC035858 


Homo sapi 




8 


510.4 


46. 


0 


3153 


11 


AK079572 


AK079572 


Mus muscu 


c 


9 


500.6 


45. 


1 


790 


14 


CF147830 


CF147830 


AGENCOURT 




10 


467.2 


42. 


1 


750 


29 


AY420886 


AY420886 


Pan trogl 




11 


462.4 


41. 


7 


726 


29 


AY420887 


AY420887 


Mus muscu 




12 


462.4 


41. 


7 


1001 


9 


AL535838 


AL535838 AL535838 




13 


385.2 


34. 


7 


993 


12 


BM926746 


BM926746 


AGENCOURT 




14 


376.2 


33. 


9 


543 


13 


BX119589 


BX119589 


BX119589 




15 


359 


32 . 


3 


788 


14 


CF147829 


CF147829 


AGENCOURT 




16 


322.8 


29. 


1 


382 


12 


BQ042116 


BQ042116 


sheepl Sh 


c 


17 


288 


25. 


9 


525 


12 


BI133700 


BI133700 


UI-M-BH3- 




18 


280.6 


25. 


3 


635 


12 


BM939496 


BM939496 


UI-M-BH3- 




19 


277.8 


25. 


0 


520 


13 


BQ269289 


BQ269289 


ik23fl2.y 




20 


264.2 


23. 


8 


627 


10 


BB632359 


BB632359 


BB632359 




21 


263.8 


23 . 


8 


599 


12 


BM933820 


BM933820 


UI-M-BH3- 




22 


261.6 


23. 


6 


611 


13 


BY723922 


BY723922 


BY723922 




23 


248.4 


22. 


4 


892 


13 


BX409735 


BX409735 


BX409735 




24 


208. 8 


18. 


8 


477 


12 


BM087401 


BM087401 


500158 MA 




25 


200.8 


18. 


1 


662 


10 


BB632883 


BB632883 


BB632883 




26 


199.8 


18. 


0 


1073 


12 


BM920548 


BM920548 


AGENCOURT 


c 


27 


197.8 


17. 


8 


625 


13 


BQ285933 


BQ285933 


ik23f 12.x 




28 


197.6 


17. 


8 


505 


10 


BB651179 


BB651179 


BB651179 




29 


195.2 


17. 


6 


245 


12 


BI976482 


BI976482 


485407 MA 




30 


186.2 


16. 


8 


464 


13 


BY239887 


BY239887 


BY239887 




31 


177 


15. 


9 


444 


14 


R55704 


R55704 yg88hl0.rl 


c 


32 


164.2 


14. 


8 


703 


29 


CE375359 


CE375359 


tigr-gss- 




33 


162 


14. 


6 


721 


29 


CE235359 


CE235359 


tigr-gss- 




34 


161.6 


.14 : . 


6 


1290 


29 


AY411591 


AY41159L 


liomo sapi 




35 


156.2 


14. 


1 


1296 


29 


AY411593 


AY411593 


Mus muscu 


c 


36 


155.6 


14. 


0 


1013 


9 


AL535837 


AL535837 AL535837 




37 


146.2 


13. 


2 


768 


13 


BX109847 


BX109847 


BX109847 




38 


133 


12. 


0 


257 


10 


AW427900 


AW427900 


64510 MAR 




39 


120.2 


10.8 


526 


29 


CG978334 


CG978334 


CH240 169 


c 


40 


119.2 


10. 


7 


1005 


28 


CC212654 


CC212654 


CH261-75F 


c 


41 


119.2 


10. 


7 


1058 


28 


CC297061 


CC297061 


CH261-177 




42 


117.4 


10. 


6 


1113 


29 


AY420480 


AY420480 


Homo sapi 




43 


115.4 


10. 


4 


1100 


29 


AY420481 


AY420481 


Pan trogl 


c 


44 


114.4 


10. 


3 


1194 


28 


CC279941 


CC279941 


CH261-24C 




45 


108.8 


9. 


8 


1113 


29 


AY420482 


AY420482 


Mus muscu 



ALIGNMENTS 



RESULT 1 
BC035686 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC035686 1740 bp mRNA linear HTC 20-SEP-2002 

Homo sapiens, Similar to hypocretin (orexin) receptor 1, clone 
IMAGE: 5750551, mRNA. 
BC035686 

BC035686.1 GI:23242909 
HTC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1740) 
Strausberg, R. 
Direct Submission 

Submitted ( 31- JUL-2002 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact: MGC help desk 

Email : cgapbs-r@mail . nih . gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: National Institutes of Health Intramural 

Sequencing Center (NISC) , 

Gaithersburg, Maryland; 

Web site: http://www.nisc.nih.gov/ 

Contact: nisc_mgc@nhgri.nih.gov 

Akhter,N. , Ayele,K. , Beckstrom-Sternberg, S .M. , Benjamin, B. , 
Blakesley,R.W. , Bouf f ard, G. G. , Breen,K., Brinkley,C, Brooks, S., 
Dietrich, N.L. , Granite, S., Guan,X., Gupta, J., Haghighi,P., 
Hansen, N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., 
Maduro, Q. L. , Masiello, C. , Maskeri, B. , Mastrian, S . D. ,McCloskey, J. C . , 
McDowell, J., Pearson, R. , Stantripop, S . , Thomas, P. J., Touchman, J. W. , 
Tsurgeon,C, Vogt,J.L., Walker, M. A., Wetherby, K. D. , Wiggins, L., 
Young, A., Zhang, L.-H. and Green, E.D. 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: -http://image.llnl.gov 
Series: IRAK Plate: 79 Row: m Column: 17 

This clone was selected for full length sequencing because it, 
passed the following selection criteria: matched mRNA gi : 4557636 
This clone has the following problem: frame shifted. 
FEATURES Location/Qualifiers 
source 1. .1740 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="IMAGE: 5750551" 

/tissue_type="Lung, Spleen, fetal, pooled" 



/clone_lib="NIH_MGC_122 ,f 
/lab_host="DH10B" 
/note="Vector: pCMV-SP0RT6" 

ORIGIN 

Query Match 65.7%; Score 729.2; DB 11; Length 1740; 

Best Local Similarity 83.2%; Pred. No. 4e-138; 

Matches 923; Conservative 0; Mismatches 8; Indels 179; Gaps 1; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 506 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 565 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 566 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCG'CGATTATCTG 625 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 626 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 685 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 
I I I I I I I I I I I I I I I I I I 

Db 686 CTGGTGGGCAACACGCTG 703 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

Db 704 703 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

Db 704 703 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 704 — GGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 746 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 747 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 806 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 807 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 866 

Qy 541 GCAGTCATGCAATCCAGCAGT GT GCTGCC f GAGCTAGCCAACCGCACACGGCTCTT CTCA 600 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II II I I I I 
Db 867 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 926 

Qy 601 CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 927 GT CTGT GAT GAAC GCT GGGC AGAT GACC TCT AT C CCAAGAT CT AC C ACAGT TGCTTCTTT 986 



Qy 

Db 



661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I 
987 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 104 6 



Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1047 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 1106 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1107 C C CT CAGAC CAGCT G G GGGAC CT GGAGCAG GGC CT GAGT GGAGAGC CC CAGC CCCGGGCC 1166 



Qy 841 CGCGCCTTCCT GGCT GAAGT GAAGC AGAT GC GT GC ACGGAGGAAGACAG C CAAGAT GCT G 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1167 CGCGCCTT CCTGGCT GAAGT GAAGC AGAT GCGT GCACGGAGGAAGACAGCCAAGATGCT G 1226 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTC7VATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I 
Db 1227 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1286 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1287 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1346 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1347 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1406 

Qy 1081 CTCAGTGGCCXTCCCTGGAGTCTGCTCTAA 1110 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1407 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1436 



RESULT 2 

BX433093/c 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BX433093 886 bp mRNA linear EST 15-MAY-2003 

BX433093 Homo sapiens FETAL BRAIN Homo sapiens cDNA clone 
CS0DF013YE04 3-PRIME, mRNA sequence. 
BX433093 

BX433093.1 GI:30779168 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 886) 

Li,W.B., Gruber,C, Jessee, J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished (2001) 
Contact: Genoscop'e" 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref @genoscope . ens . f r , Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 151. r For 
more information about this cluster, see 
http : / /www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0BAI011ZB01_CS00962_2&cluster=151- r . 
Contact : Feng Liang Email : fliang@lifetech.com URL : 
http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSOBAI011ZB01_CS00962 2. 



FEATURES Location/Qualifiers 
source 1. .886 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DF013YE04" 
/tissue_type="FETAL BRAIN" 
/dev_s tage=" fetal " 

/clone_lib="Homo sapiens FETAL BRAIN" 

/note="Organ: brain; Vector: pCMVSPORT_6; 1st strand cDNA 
was primed with a Notl-oligo (dT) primer. Five prime end 
enriched, double-strand cDNA was digested with Not I and 
cloned into the Not I and EcoRV sites of the pCMVSPORT 6 
vector. Library was not normalized." 

ORIGIN 

Query Match 62.7%; Score 696; DB 13; Length 886; 

Best Local Similarity 98.5%; Pred. No. 1.7e-131; 

Matches 702; Conservative 0; Mismatches 11; Indels 0; Gaps 0; 



Qy 377 AGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGT 436 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 745 AGGCTGTGTCCGTGTCAGTGACAGTGCTAACTCTCAGCTTCATCGCCCTGGACCGCTGGT 686 

Qy 437 ATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCC 496 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 685 ATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATNC 626 

Qy 497 TGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCA 556 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II 
Db 625 TGGGCATCTGGCCTGTGTCGCTGGCCATCATGGTGCCCAGGGCTGCAGTCATGCAATGCA 566 

Qy 557 GCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCT 616 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 565 GCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCAGTCTGTGATGAACGCT 506 

Qy 617 GGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTTATTGTCACCTACCTGG 67 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 505 GGGCAGATGACCTCTATCCCT^AGATCTACCACAGTTGCTTCTTTATTGTCACCTACCTGG 446 

Qy 677 CCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCC 736 

I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 445 CCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCC 386 

Qy 737 AGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGG 796 

I IN I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I nMMM I 

Db 385 AGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGG 326 

Qy 797 GGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCTG 856 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 325 GGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGCCCGCGCCTTCCTGGCTG 266 

Qy 857 AAGT GAAGCAGAT GCGT GC AC GGAGGAAGACAGC CAAGAT GCT GAT GGT G GTGCT GCT GG 916 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 265 AAGT GAAGCAGAT GCGT GC AC GGAGGAAGACAGC CAAGAT GCT GAT GGT G GTGCT GCT GG 206 



Qy 



917 



TCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGA 976 







1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I 1 1 1 1 1 1 1 




L)D 


OAR 


TCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGA 


146 


Qy 


977 


TGTTCCGCCAAGCCAGTGACCGCG7\AGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGC 
1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 


1036 


Db 


145 


TGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGC 


86 


Qy 


1037 


TGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTCCTCAGTGGC 1089 
1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 




Db 


85 


TGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTCCTCAGTGGC 33 





RESULT 3 

BX433092/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
' AUTHORS 
TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BX433092 899 bp mRNA linear EST 15-MAY-2003 

BX433092 Homo sapiens FETAL BRAIN Homo sapiens cDNA clone 
CS0DF013YE04 3-PRIME, mRNA sequence. 
BX433092 

BX433092.1 GI: 30779167 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 899) 

Li,W.B., Gruber,C, Jessee, J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished (2001) 
Contact: Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref ©genoscope . ens . fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies , a division of 
Invitrogen. This sequence belongs to sequence cluster 151. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/ cluster. cgi?seq=CS0BAI011ZB01_CS00962_l&cluster=151. r. 
Contact : Feng Liang Email : fliang@lifetech.com URL : 
http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSOBAI011ZB01_CS00962_1 . 

Location/Qualifiers 

1. .899 

/organism="Homo sapiens" 
/mol_t ype= "mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DF013YE04" 
/tissue_type="FETAL BRAIN" 
/ dev__s tage=" fetal " 

/clone_lib="Homo sapiens FETAL BRAIN" 

/note="0rgan: brain; Vector: pCMVSP0RT__6 ; 1st strand cDNA 
was primed with a Notl-oligo (dT) primer. Five prime end 
enriched, double-strand cDNA was digested with Not I and 
cloned into the Not I and EcoRV sites of the pCMVSPORT 6 
vector. Library was not normalized." 



ORIGIN 



Query Match 



58.6%; Score 650.4; DB 13; Length 899; 



Best Local Similarity 94.0%; Pred. No. 3.5e-122; 

Matches 686; Conservative 0; Mismatches 43; Indels 1; Gaps ■ 1; 



Qy 


372 


TCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCG 


431 






Ml 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 




Db 


738 


TGTAAGNNCTGTGTCGTGTTCAGTGGCAGTGCTACTTCTCAGCTTCATCGCCTGGACCCG 


679 


Qy 


432 


CTGGTATGCCATC-TGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCT 


490 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


678 


CTGGTATGCCATCATCCCACCCACTATTGTCAAAGAGCACAGCCCGGCGGGCCCGTGCTC 


619 


Qy 


491 


CCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGC 550 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I | | | | | | | | | | | | | | | 




Db 


618 


CCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGG 


559 


Qy 


551 


AATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATG 


610 






Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I Mill IN 




Db 


558 


AATGCAGCAGT GT GCT GC CT GAGCTAGCCAACCGCACACGGCT CTTCTCAGTCT GTGATG 


499 


Qy 


611 


AAC GCT G G GC AGAT GAC CT CT AT CC CAAGAT CT AC C ACAGT TGCT TCT T TAT T GT CAC CT 


670 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


498 


AACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTTATTGTCACCT 


439 


Qy 


671 


ACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGG 


730 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


438 


ACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGG 


379 


Qy 


731 


GCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACC 


790 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


378 


GCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACC 


319 


Qy 


791 


AGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCC 


850 






1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


318 


AGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGCCCGCGCCTTCC 


259 


Qy 


851 


T GGC T GAAGT GAAGC AGATGC GT GCAC GGAGGAAGACAGC CAAGAT GCT GAT GGT GGT GC 


910 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
i i ■ 1 i i i i i I i i i i i i i \ i i i i i i i i i i i i i i i t i i i i \ \ \ \ i i i i i [ i 1 i i i i i i j i j i 




Db 


258 


T GGCT GAAGT GAAGC AGAT GC GT GCAC G GAG GAAGACAGC CAAGAT GCT GATGGT GGT GC 


199 


Qy 


911 


TGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGT 


970 






M 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 




Db 


198 


TGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGT 


139 


Qy 


971 


TCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCC 


1030 






1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | | | 1 1 1 1 1 1 1 




Db 


138 


TCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCC 


79 


Qy 


1031 


ACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTCCTCAGTGGCC 


1090 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II Ml 




Db 


78 


ACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTCAACTTCCTCAGGTGCAA 


19 


Qy 


1091 


TTCCCTGGAG 1100 




Db 


18 


1 II 1 1 1 1 

ATTCCGGGAG 9 





RESULT 4 



AY420885 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

COMMENT 

FEATURES 

source 



753 bp DNA linear GSS 17-DEC-2003 

VIRTUAL TRANSCRIPT, partial sequence, 



gene 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



AY420885 

Homo sapiens HCRTR1 gene, 
genomic survey sequence. 
AY420885 

AY420885. 1 GI:39776842 
GSS. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 

1 (bases 1 to 753) 

Clark, A. G., Glanowski , S . , Nielson,R., Thomas, P., Ke jariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T . J. , Sninsky, J. J. , 
Adams,M.D. and Cargill,M. 

Inferring nonneutral evolution from human-chimp-mouse orthologous 
gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 753) 

Clark,A.G., Glanowski, S . , Nielson,R., Thomas, P., Ke j ariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
Direct Submission 

Submitted ( 16-NOV-2003 ) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence as made by sequencing genomic exons and ordering them 
based on alignment. 

Location/Qualifiers 

1. .753 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 
<1. .>753 
/gene="HCRTRl" 
/locus_tag="HCM7373" 



ORIGIN 



Query Match 49.9%; 
Best Local Similarity 98.9%; 
Matches 558; Conservative 



Score 554.4; DB 29; 
Pred. No. 1.2e-102; 
0; Mismatches 6; 



Length 753; 



Indels 



0; Gaps 



Qy 


526 


Db 


1 


Qy 


586 


Db 


61 


Qy 


646 


Db 


121 



ATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGC 
I I I I I I I I I UTM I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGGTGCCCCAGGCTGCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGC 



0; 
585 

.TP.**- 

60 
645 



ACAC G GCT CT TCT CACT CT GT CAT GAAC GCT GGGC AGAT GAC CT CT AT C C CAAGATCT AC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACACGGCTCTTCTCAGTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTAC 12 0 

CACAGTTGCTTCTTTATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTAT 705 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CACAGTTGCTTCTTTATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTAT 180 



Qy 



706 T T C CAGAT AT T C CGCAAG CT CT GGGG C C GCC AGAT C C CCGGCAC C ACCT CAGC ACT GGT G 765 



Db 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | 

181 TTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTG 240 



Qy 766 CGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAG 825 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 CGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAG 300 

Qy 826 CCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAG 885 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 CCCCAGCCCCGGGCCCGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAG 360 

Qy 886 ACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGC 945 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 ACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGC 420 

Qy 94 6 GTCCTCAAT'GTCCTTAAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCT 1005 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 GTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCT 480 

Qy 1006 GTCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCC 1065 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I |J I I I I I I | | | 
Db 481 GTCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCC 54 0 

Qy 1066 ATCATCTACAACTTCCTCAGTGGC 1089 

I II I I I I I I I I I I I I I I I I I I I I I 
Db 541 ATCATCTACAACTTCCTCAGTGGC 564 



RESULT 5 
AK048781 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



AK048781 3470 bp mRNA linear HTC 20-SEP-2003 

Mus musculus 0 day neonate cerebellum cDNA, RIKEN full-length 
enriched library, clone : C230065B06 product : OREXIN RECEPTOR TYPE 2, 
full insert sequence. 
AK048781 

AK04 8781. 1 GI: 26339571 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol. 303, 19-44 (1999) 
"99279253 
10349636 
2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., Carninci,P., 



Konno,H., Akiyama,J., Nishi,K., Kitsunai , T . , Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura, S . , Hazama,M., Nishine,T., Harada,A. , 
Yamamoto,R., Matsumoto, H. , Sakaguchi, S . , Ikegami,T., Kashiwagi, K. , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M. , 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu, M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 

TITLE RIKEN integrated sequence analysis (RISA) system — 384-format 

sequencing pipeline with 384 multicapillary sequencer 

JOURNAL Genome Res. 10 (11), 1757-1771 (2000) 

MEDLINE 20530913 
PUBMED 11076861 
REFERENCE 4 

AUTHORS The RIKEN Genome Exploration Research Group Phase II Team and the 

FANTOM Consortium. 
TITLE Functional annotation of a full-length mouse cDNA collection 

JOURNAL Nature 409, 685-690 (2001) 
REFERENCE 5 

AUTHORS The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

TITLE Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 

JOURNAL Nature 420, 563-573 (2002) 
REFERENCE 6 (bases 1 to 3470) 

AUTHORS Ada chi, J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci., P . , 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A. , Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama, T . , Miyazaki,A. , Murata,M., 
Nakamura,M., Nishi,K., Nomura, K. , Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A. , Toya,T., Yasunishi,A. , 
Muramatsu,M. and Hayashizaki, Y. 

TITLE Direct^ Submission 

JOURNAL Submitted ( 16- JUL-2001 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN), Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome-res@gsc . riken. go. jp, 
URL :http: //genome. gsc.riken. go. jp/, Tel : 81-45-503-9222 , 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URLrhttp: //genome . gsc . riken. go. jp/ 
URL:http: //fantom. gsc . riken . go . jp/ . 
FEATURES Location/Qualifiers 
source 1. .3470 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57BL/6J" 
/db xref="FANTOM DB : C230065B06" 



\ 



CDS 



polyA_signal 
polyA_site 
ORIGIN 



/db_x r e f = "MGI :2415851" 
/db_xre.f="taxon: 10090" 
/clone="C230065B06" 
/tissue_type-" cerebellum" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="0 day neonate" 
75. .1457 

/note="unnamed protein product; OREXIN RECEPTOR TYPE 2 
(SWISSPROTIP56719, evidence: FASTY, 98.5%ID, 100%length, 
match=1380) 
putative" 
/ codon_start=l 
/protein_id="BAC33457 . 1 " 
/db_xref="GI: 26339572" 

/ translation="MSSTKLEDSLSRRNWSSASELNETQEPFLNPTDYDDEEFLRYLW 

REYLHPKEYEWLIAGYIIVFWALIGNVLVCVAWKNHHMRTWNYFIWLSLADVL 

VTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVLTLSCIALDRWYAICHPL 

MFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANKTTLFTVCDEHWGGEV 

YPKMYHICFFLWYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQRKWKQQQPVSQPR 

GSGQQSKARI SAVAAEI KQI RARRKTARMLMWLLVFAI C YLPI S I LNVLKRVFGMFT 

HTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAFSCCLGVHHRQGDR 

LARGRTSTESRKSLTTQISNFDNVSKLSEHWLTSISTLPAANGAGPLQNWYLQQGVP 

SSLLSTWLEV" 

3455. .3460 

/note= "putative" 

3470 

/note= "putative" 



Query Match 47.0%; Score 521.4; DB 11; Length 3470; 

Best Local Similarity 70.6%; Pred. No. 1.3e-95; 

Matches 712; Conservative 0; Mismatches 291; Indels 6; Gaps 



l; 



Qy 



Db 



80 AT GAAGAT GAGTT T CT C C GCT AT CT GT GGC GT GAT TAT CT GT AC CCAAAACAGTAT GAGT 139 
I II II II II II II II I I I I I I I II II II I I I I III I I I I I I I I 
178 AC GAC GAGGAATT C CT GC GGT AC CT GT GGAGGGAAT AC CT AC AC CC GAAAGAAT AT GAGT 237 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



140 



238 



200 



GGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCCCTGGTGGGCAACACGCTGG 199 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GGGTCCTGATCGCAGGGTATATCATCGTGTTCGTTGTGGCTCTCATCGGGAACGTCCTGG 297 



259 



TCT GC CT GGC C GT GT G GC GGAAC CAC CACAT GAG GACAGT CAC CAACTACTT CAT T GT CA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2 98 TCTGTGT GGCAGT GTGGAAGAAC CAC CACAT GAGGACAGT CACCAACTACTTCATAGTCA 357 



260 



358 



320 



319 



ACCTGTCCCTGGC^GACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ACCTTTCCCTAGCAGATGTGCTTGTGACCATCACCTGCCTTCCAGCTACCCTCGTTGTTG 417 



379 



ACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGG 
I I I I I I II I I I I III I II M II I I I I I I I I I I I I I I I II III I I I I I 
418 AC AT C ACT GAGACT T GGTT CT T T GGAC AGT C C CT CT GT AAGGT CATT C CTT ATTT ACAGA 477 



Qy 



Db 



38 0 CTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATG 439 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
478 CTGTGTCAGTGTCTGTGTCTGTTCTTACGTTGAGCTGCATTGCCTTGGACCGATGGTACG 537 



Qy 

Db 



440 

538 



CCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGG 

I I I I I I I I I I I I I I I I I I ! I i I I I I I I I I I I I I I I I I I I I I I I 

C CATT T GTC AC CCT T T GAT GT T CAAGAGC AC AGC CAAAC GGGCT C GAAACAGC AT C GT T G 



499 
597 



Qy 500 GCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCA 559 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 598 T CAT CT GGAT C GT CT C CT GCAT CATAAT GAT T CCT CAAGC CATT GT CAT GGAGT GCAG CA 657 

Qy 560 GTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGG 619 

I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I II I I I I I 

Db 658 GCATGCTCCCTGGCCTAGCCAATAAGACCACCCTCTTTACAGTATGTGATGAACACTGGG 717 

Qy 620 CAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTTATTGTCACCTACCTGGCCC 679 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I 

Db 718 GCGGT GAAGT T TACC CAAAGAT GT AC CAT AT CTGCTTCTTTCTGGT GACATACAT GGCAC 777 

Qy 680 CACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGA 739 

I Ml I M III, I I I I III I I I I I I I I I I I I II I I I II I I I I I I I I I 
Db 77 8 CTCTGTGTCTTATGATATTGGCTTATCTCCAAATATTCCGTA7\ACTCTGGTGCCGACAGA 837 

Qy 740 TCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGG 799 

I I I I I I II I I I I I II I I I I I I I I I I I 

Db 838 TTCCCGGAACTTCTTCTGTGGTTCAGAGAAAATGGAAGCAGC AGCAGCCGGTTT 891 

Qy 8 00 ACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAG 859 

II I I I I I II I I I I II I I I I I I I I I I I 

Db 8 92 CTCAGCCCCGGGGGTCCGGACAGCAGAGCAAGGCTCGGATTAGCGCTGTTGCTGCTGAGA 951 

Qy 860 T GAAGCAGAT GCGTGCACGGAGGAAGACAGCCAAGATGCT GATGGTGGTGCT GCT GGTCT 919 

I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I | | || || | | | | | | | 
Db 952 T AAAGC AGAT C C GAG CAC GAAG GAAAACAGC C CGGAT GCT CAT GGT T GTACTT CT GGT CT 1011 

Qy 920 TCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGT 97 9 

I M I I I I I I M II I I I I I I I I I I I I I I I I II I I I I I II II I I I I I I I 
Db 1012 TT GCAATTTGCTATCTACCAAT CAGCAT CCT CAAT GTGCTAAAGAGAGTATTTGGGATGT 1071 

Qy 980 TCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGG 1039 

I I M I Ml I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1072 T CACACACACGGAAGACAGAGAGACT GTCTATGCTT GGTT CACTTTTT CTCATTGGCTTG 1131 

Qy 1040 T GT AC GC CAACAGC GCT GCCAAC C C CAT CAT CT ACAACTT C CT C AGT GG 1088 

I M I I I I I I I I I I I I I Mill || || || || || || | II I I 
Db 1132 T ATATG CCAAC AGT GCT GC AAAC CCAAT TAT T TAT AAT TT T CT TAGT GG 1180 

RESULT 6 ,. '. - , • 

AK038551 

LOCUS AK038551 3729 bp mRNA linear HTC 19-SEP-2003 

DEFINITION Mus musculus adult male hypothalamus cDNA, RIKEN full-length 

enriched library, clone :A230036M08 product : OREXIN RECEPTOR TYPE 2, 

full insert sequence. 
ACCESSION AK038551 

VERSION - AK038551.1 GI:26332642 
KEYWORDS HTC; CAP trapper. 
SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



Sasaki, N. , Carninci, P. 
. Tashiro,H., Itoh,M., 



Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus . 
REFERENCE .1 

AUTHORS Carninci, P. and Hayashizaki, Y. 
TITLE High-efficiency full-length cDNA cloning 

JOURNAL Meth. Enzymol. 303, 19-44 (1999) 
MEDLINE 99279253 
PUBMED 10349636 
REFERENCE 2 

AUTHORS Carninci, P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 
TITLE Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 
JOURNAL Genome Res. 10 (10), 1617-1630 (2000) 
MEDLINE 20499374 
PUBMED 11042159 
REFERENCE 3 

AUTHORS Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., 
Konno,H., Akiyama,J., Nishi,K., Kitsunai,T. 
Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A. , 
Yamamoto,R., Matsumoto, H . , Sakaguchi, S . , Ikegami,T., Kashiwagi, K. , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 

TITLE RIKEN integrated sequence analysis (RISA) system — 384-format 

sequencing pipeline with 384 multicapillary sequencer 

JOURNAL Genome Res. 10 (11), 1757-1771 (2000) 

MEDLINE 20530913 
PUBMED 11076861 
REFERENCE 4 

AUTHORS The RIKEN Genome Exploration Research Group Phase II Team and the 

FANTOM Consortium. 
TITLE Functional annotation of a full-length mouse cDNA collection 

JOURNAL Nature 409, 685-690 (2001) 
REFERENCE 5 

AUTHORS The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

TITLE Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 

JOURNAL Nature 420, 563-573 (2002) 
REFERENCE 6 (bases 1 to 3729) 

AUTHORS Adachi, J. , Aizawa, K. , Akimura,T., Arakawa,T., Bono,H., Carninci, P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M. , 
Koya,S., Kurihara,C, Matsuyama, T . , Kiyazaki, A. , Murata,M., 
Nakamura,M., Nishi,K., Nomura, K., Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki, Y. , Saito, R. , Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa,A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A., Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y. 

TITLE Direct Submission 

JOURNAL Submitted ( 16- JUL-2001 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN), Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 



COMMENT 



FEATURES 

source 



CDS 



polyA_signal 



polyA_site 



ORIGIN 



Kanagawa 230-0045, Japan ( E-mail : genome-res@gsc . riken . go . jp, 
URL: http: //genome. gsc. riken. go. jp/, Tel : 81^4 5-503-9222 , 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL: http: // genome. gsc. riken. go. jp/ 
URL: http : //fantom. gsc. riken. go. jp/ . 

Location/Qualifiers 

1. .3729 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_x r e f = " FANTOM__DB : A2 3 0 0 3 6M0 8 " 

/db_xref ="MGI : 2402981" 

/db_xref="taxon: 10090" 

/ cl one- " A2 3 0 0 3 6M0 8 " 

/ sex="male" 

/tissue_type="hypothalamus" 

/clone__lib="RIKEN full-length enriched mouse cDNA library" 

/dev_stage="adult" 

76. .1458 

/note="unnamed protein product; OREXIN RECEPTOR TYPE 2 
(SWISSPROTI P56719, evidence: FASTY, 98.5%ID, 100%length, 
match=1380) 
putative" 
/ codon_start=l 
/protein_id-"BAC30039. 1" 
/db_xref ="GI : 26332643" 

/ translation="MSSTKLEDSLSRRNWSSASELNETQEPFLNPTDYDDEEFLRYLW 
REYLHPKEYEWLIAGYIIVFWALIGNVLVCVAWKNHHMRTWNYFIWLSLADVL 
VTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVLTLSCIALDRWYAICHPL 
MFKSTAKRARNSIWIWIVSCIIMIPQAIVMECSSMLPGLANKTTLFTVCDEHWGGEV 
YPKMYHICFFLWYMAPLCLMILAYLQIFRKLWCRQIPGTSSWQRKWKQQQPVSQPR 
G S GQQ S KARI SAVAAE I KQ I RARRKTARMLMWLL VFAI C YL P I S I LNVLKRVFGMFT 
HTEDRETVYAWFTFSHWLVYANSAANPIIYNFLSGKFREEFKAAFSCCLGVHHRQGDR 
LARGRTSTESRKSLTTQISNFDNVSKLSEHWLTSISTLPAANGAGPLQNWYLQQGVP 
SSLLSTWLEV" 
3712. .3717 
/note="putative" 
3729 

/note="putative" 



Query Match 47.0%; 
Best Local Similarity 70.6%; 
Matches 712; Conservative 



Score 521.4; DB 11; 
Pred. No. 1.4e-95; 
0; Mismatches 291; 



Length 3729; 
Indels 6; Gaps 



1; 



Qy 



Db 



80 AT GAAGAT GAGT TT C T C CGCT AT CT GT GGCGT GATT AT CT GT AC C CAAAACAGT AT GAGT 139 

I > M I M I II I II I I I I I I I I 

179 ACGACGAGGAATTCCTGCGGTACCTGTGGAGGGAATACCTACACCCGAAAGAATATGAGT 238 



Qy 



14 0 GGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCCCTGGTGGGCAACACGCTGG 199 

lllllll M Ml II Mill I I I I I 



Db 239 GGGTCCTGATCGCAGGGTATATCATCGTGTTCGTTGTGGCTCTCATCGGGAACGTCCTGG 298 

Qy 200 T CT G C CT GGCCGT GTG GC GGAACCAC CAC AT GAGGACAGTCAC CAACT ACT T CAT T GT CA 259 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db .299 T CT GT GTG GC AGT GT GGAAGAACCAC CACAT GAGGAC AGT CAC CAAC TACT T CAT AGT C A 358 

Qy 260 ACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGG 319 

I I I I I 1.1 I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 359 ACCTTTCCCTAGCAGATGTGCTTGTGACCATCACCTGCCTTCCAGCTACCCTCGTTGTTG 418 

Qy 320 ACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGG 379 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 419 AC AT C ACT GAGACT T GGTTCT T T GGACAGT C CCT CT GT AAGGT CATT C CT T ATTT ACAGA 478 

Qy 380 CTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATG 439 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 479 CTGTGTCAGTGTCTGTGTCTGTTCTTACGTTGAGCTGCATTGCCTTGGACCGATGGTACG 538 

Qy 440 CCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGG 499 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 539 C C ATTT GT CACCCT T T GAT GT T CAAGAGCACAGC CAAAC GGGCT CGAAACAGCAT C GTT G 598 

Qy 500 GCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCA 559 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 599 T CAT CT GGAT C GT C T C CT GCAT CAT AATGAT T C CT CAAGC CATT GT CAT GGAGT GC AGC A 658 

Qy .560 GTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGG 619 

I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 659 GCATGCTCCCTGGCCTAGCCAATAAGACCACCCTCTTTACAGTATGTGATGAACACTGGG 718 

Qy 620 CAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTTATTGTCACCTACCTGGCCC 679 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 719 G C GGT GAAGT T T AC C CAAAGAT GTAC CAT AT CT G CTTCTTTCTGGT GACAT ACAT G GCAC 778 

Qy 680 CACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGA 739 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 779 CTCTGTGTCTTATGATATTGGCTTATCTCCAAATATTCCGTAAACTCTGGTGCCGACAGA 838 

Qy 740 TCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGG 799 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 839 T T CC C GGAACT TCT T CT GT GGT T CAGAGAAAAT GGAAGC AGC AGCAGCCGGTTT 892 

Qy 8 00 ACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAG 859 

II I I I I I II I II I III I I I I I I I I I I 

Db 893 CTCAGCCCCGGGGGTCCGGACAGCAGAGCAAGGCTCGGATTAGCGCTGTTGCTGCTGAGA 952 

Qy 860 T GAAGCAGAT GCGT GCAC G GAGGAAGACAGCCAAGAT GCT GAT GGT GGT GCT GCT GGT CT 919 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I 
Db 953 TAAAGC AGAT C CGAGCAC GAAGGAAAAC AG C C C GGAT G CT CAT GGTT GT ACT T CT GGT CT 1012 

Qy 920 TCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGT 979 

I II I I I I I I II II I I I I I I I I I I I I I I I I II I I I I I II II I I I I I I I 

Db 1013 TTGCAATTTGCTATCTACCAATCAGCATCCTCAATGTGCTAAAGAGAGTATTTGGGATGT 1072 

Qy 980 TCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGG 1039 

II M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 1073 T CACACACACG GAAGACAGAGAGACT GT CT AT GCT T GGTT CACT T TT T CT CAT T GGCTT G 1132 



Qy 

Db 



1040 T GTAC GC CAACAGC G CT GC CAACC CCAT CAT CTACAACT T C CT CAGT GG 1088 

I II I I I I I I I I I I I I I I I I I I II II II II II II I I I I I 
1133 TAT AT GC CAACAGTGCTGCAAACCCAATT ATTTAT AATTTT CTTAGTGG 1181 



RESULT 7 
BC035858 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC035858 1790 bp mRNA linear HTC 04-MAR-2003 

Homo sapiens, Similar to hypocretin (orexin) receptor 2, clone 
IMAGE: 5767576, mRNA. 
BC035858 

BC035858.1 GI:23959160 
HTC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1790) 
Strausberg, R. 
Direct Submission 

Submitted ( 31- JUL-2002 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact: MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: National Institutes of Health Intramural 

Sequencing Center (NISC) , 

Gaithersburg, Maryland; 

Web site: http://www.nisc.nih.gov/ 

Contact: nisc_mgc@nhgri.nih.gov 

Akhter,N. , Ayele, K. , Becks trom-Sternberg, S .M. , Benjamin, B. , 
Blakesley, R.W. , Bouf f ard, G. G . , Breen,K., Brinkley,C, Brooks, S., 
Dietrich, N. L. , Granite, S., Guan,X., Gupta, J., Haghighi,P., 
Hansen, N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., 
Maduro,Q. L. , Masiello, C. , Maskeri, B. , Mastrian, S . D. ,McCloskey, J. C. , 
McDowell, J., Pearson, R. , Stantripop, S . , Thomas, P. J., Touchman, J. W. , 
Tsurgeon,C, Vogt,J.L., Walker, M. A., Wetherby, K. D . , Wiggins, L., 
Young, A., Zhang,L.-H. and Green, E.D. 



Clone distribution: MGC clone distribution information 'tfan be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 79 Row: p Column: 14 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 6006037 
This clone has the following problem: retained intron. 
FEATURES Location/Qualifiers 
source 1. .1790 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 5767576" 



/tissue_type="Brain, fetal, whole pooled" 
/ clone_lib="NIH_MGC_12 1 " 
/lab_host="DH10B" 
/note="Vector: pCMV-SP0RT6" 

ORIGIN 



Query Match 46.1%; Score 511.8; DB 11; Length 1790; 

Best Local Similarity 70.0%; Pred. No. 9e-94; 

Matches 706; Conservative 0; Mismatches 297; Indels 6; Gaps 1 

Qy 80 AT GAAGAT GAGT T T CT C CG CTAT CT GT GGC GT GATT ATCT GT AC C CAAAACAGT AT GAGT 139 

I II M M M I I II II I I I I I I I II II III I I I I III I I I I II I I 
Db 146 ACGACGAGGAATT CCT GCGGTACCTGT GGAGGGAATACCT GCACCC GAAAGAATAT GAGT 205 

Qy 140 GGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCCCTGGTGGGCAACACGCTGG 199 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M ill I I I I 

Db 206 GGGTCCTGATCGCCGGGTACATCATCGTGTTCGTCGTGGCTCTCATTGGGAACGTCCTGG 265 

Qy 200 TCTGCCTGGCCGT GT GGCGGAAC CAC CAC AT GAGGACAGT CAC CAACT ACTT C ATT GT CA 259 

I M I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I 
Db 266 TT T GT GT GGCAGT GTGGAAGAAC CAC CAC AT GAGGAC GGTAAC CAACTACT T CATAGT CA 325 

Qy 260 ACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGG 319 

I II M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 326 ATCTTTCTCTGGCTGATGTGCTCGTGACCATCACCTGCCTTCCAGCCACACTGGTCGTGG 385 



Qy 320 ACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGG 379 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 386 ATATCACTGAGACCTGGTTTTTT GGACAGT CCCTTT GCAAAGT GATTCCTT AT CTACAGA 445 

Qy 380 CTGTGTCCGTGTCAGTGGCAGTGCT7^CTCTCAGCTTCATCCCCCTGGACCGCTGGTATG 439 

I I I I I I I I I I I III I II II II II I I I I III II I I I I || I I I I I M 
Db 44 6 CCGTGTCGGTGTCTGTGTCTGTCCTCACACTGAGCTGTATCGCCTTGGATCGGTGGTATG 505 

Qy 4 40 CCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGG 499 

I I I I I I I I I I I -I I I I I I I I I I I I I I I I I I I I I I I I I I I III I 
Db 506 CAAT CT GT CAC C CTTT GAT GT TT AAGAGC ACAGCAAAGC GGGC C C GTAACAGC AT T GT CA 565 



Qy 500 GCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCA 559 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 566 T CAT CT GGAT T GT CT CCTGC ATT AT AAT GATT CCT CAGGC CAT C GT C ATGGAGT GC AGCA 625 

Qy 560 GTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGG 619 

I I I I I I I I I I I I I I II Mill I I I I I I I II II I I I I I 

Db 626 CCGTGTTCCCAGGCTTAGCCAATAAAACCACCCTCTTTACGGTGTGTGATGAGCGCTGGG 685 

Qy 620 CAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTTATTGTCACCTACCTGGCCC 679 

I III I I II I I I II II I II I I I II II II I I I I I II II III MM I 
Db 686 GT G GT GAAAT T T AT CC CAAGATGT AC CAC AT CTGTTTCTTTCTGGT GACAT ACAT G GC AC 745 



Qy 680 CACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGA 739 

M I M I II II II I I I II II I I II I II II Mill I II I I I I II II II 
Db 746 CACT GT GT CT CAT GGT GT TGG CT TAT CT GCAAAT ATT T CGCAAACT CT GGT GT C GAC AGA 805 

Qy 74 0 TCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGG 799 

II II II I I I'll I I I I I II I II II I I II II I I I 

Db 806 T CCCT GGAACAT CAT CT GT AGTT CAGAGAAAAT GGAAGCC C C TGCAGCCTGTTT 859 



Qy 

Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



800 



860 



860 



920 



920 



980 



ACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTGCTGGCTGAAG 859 

II I Ml I I I I I I I I I I I I I I I I I I I I I 

CACAGCCTCGAGGGCCAGGACAGCCAACGAAGTCCCGGATGAGCGCTGTGGCGGCTGAAA 919 

TGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCT 919 
I I I I I I I I I II II I I I I I I I II I I I II I I I I I I I I I Mill I I I I I 

TAAAGCAGAT C C GAGCCAGAAGGAAAACAGC C C GGAT GT T GAT GGT T GT GCT T T T GGTAT 979 

TCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGT 979 
I II I I I I I I II II II III I I I I I I I I II II Mill II II I I I I I I I 

T T GCAATT T GCT AT CTAC CAAT TAGC AT C CT CAAT GT GCT AAAGAGAGT ATTT GGGATGT 1039 



98 0 TCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGG 1039 
I Ml I. I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I 

104 0 TT GC C CAT ACT GAAGACAGAGAGACT GT GT AT GC CT GGT T T AC CTT T T CACACT GGCTT G 1099 

1040 TGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTCCTCAGTGG 1088 

I II I I I I I II I I I I I II II II II II II II I I I I I I I I 
1100 TAT AT GCCAATAGT GCTGC GAATCCAATTATTTATAATTTTCTCAGT GG 1148 



RESULT 8 
AK079572 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



AK079572 3153 bp mRNA linear HTC 19-SEP-2003 

Mus musculus adult male hypothalamus cDNA, RIKEN full-length 
enriched library, clone :A230091E19 product : OREXIN RECEPTOR TYPE 2, 
full insert sequence. 
AK079572 

AK079572.1 GI:26348079 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., Carninci,P., 
Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A. , 
Yamamoto,R., Matsumoto, H . , Sakaguchi, S . , Ikegami,T., Kashiwagi, K. , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M. , Ohara,E., Watahiki,M. , 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu, M. , Inoue,Y., Kira,A. and Hayashizaki , Y. 



TITLE RIKEN integrated sequence analysis (RISA) system — 384-format 

sequencing pipeline with 384 multicapillary sequencer 
JOURNAL Genome Res. 10 (11), 1757-1771 (2000) 
MEDLINE 20530913 
PUBMED 11076861 
REFERENCE 4 

AUTHORS The RIKEN Genome Exploration Research Group Phase II Team and the 
FANTOM Consortium. 

TITLE Functional annotation of a full-length mouse' cDNA collection 

JOURNAL Nature 409, 685-690 (2001) 
REFERENCE 5 

AUTHORS The FANTOM Consortium and the RIKEN Genome Exploration Research 
Group Phase I & II Team. 

TITLE Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 

JOURNAL Nature 420, 563-573 (2002) 
REFERENCE 6 (bases 1 to 3153) 

AUTHORS Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W. , 
Hayashida,K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,!., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama, T . , Miyazaki,A., Murata,M., 
Nakamura,M. , Nishi,K., Nomura, K., Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A. , Takahashi, F. , Takaku-Akahira, S .', 
Takeda,Y., Tanaka,T., Tomaru,A., Toya,T., Yasunishi,A. , 
Muramatsu,M. and Hayashizaki, Y. 

TITLE Direct Submission 

JOURNAL Submitted ( 16-APR-2002 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome-res@gsc . riken. go . jp, 
URL :http: //genome. gsc. riken. go. jp/, Tel : 81-45-503-9222, 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URLrhttp: //genome . gsc . riken. go. jp/ 
URLrhttp: / / fantom. gsc . riken . go . jp/ . 
FEATURES .. Location/Qualifiers 

source 1. .3153 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xre f = " FANTOM_DB : A2 3 0 0 9 1E1 9 " 

/db_x re f = "MGI :2403517" 

/db_xref="taxon: 10090" 

/clone="A230091E19" 

/sex="male" 

/ tissue_type="hypothalamus " 

/clone_lib=" RIKEN full-length enriched mouse cDNA library" 



/dev_stage="adult" 
CDS 108. .1202 

/note="unnamed protein product; OREXIN RECEPTOR TYPE 2 
(SWISSPROTI P56719, evidence: FASTY, 98.5%ID, 100%length, 
match=1380) 
putative " 
/codon_start=l 
/protein_id="BAC37688. 1" 
/db_xref="GI : 26348080" 

/translation="MSSTKLEDSLSRRNWSSASELNETQEPFLNPTDYDDEEFLRYLW 
REYLHPKEYEWVLIAGYIIVFWALIGNVTjV^ 

VTITCLPATLWDITETWFFGQSLCKVIPYLQTVSVSVSVLTLSCIALDRWYAICHPL 
MFKSTAKRARNS I WIWIVSCI IMI P Q AI VMEC S SML P G LAN KTT L FT VC D EHWGG EV 
YPKMYHICFFLVTYMAPLFIJyill^YLQIFRKLWCRQIPGTSSWQRKWKQQQPVSQPR 
GS GQQS KARVSAVAAEI KQI RARRKTARMLMWLLVFAI CYLPI S I LNVL KRVFGMFT 
HT EDRETVYAWFT FPHWLVYANS CCKPN YL " 

ORIGIN 

Query Match 46.0%; Score 510.4; DB 11; Length 3153; 

Best Local Similarity 70.5%; Pred. No. 2.2e-93; 

Matches 712; Conservative 0; Mismatches 291; Indels 7; Gaps 2; 

Qy 80 AT GAAGAT GAGT T TCT C C G CTAT CT GT GGC GT GATT ATCT GT AC GCAAAACAGT AT GAGT 139 

I MM M II II II II IIMII I I I I I II I I I I III I II II I I I 
Db 211 ACGACGAGGAATT CCTGCGGTACCTGT GGAGGGAATACCTACACCCGAAAGAATAT GAGT 270 

Qy 140 GGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCCCTGGTGGGCAACACGCTGG 199 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 271 GGGTCCTGATCGCAGGGTATATCATCGTGTTCGTTGTGGCTCTCATCGGGAACGTCCTGG 330 

Qy 200 TCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTCACCAACTACTTCATTGTCA 259 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 331 T CT GT GT G GCAGT GT GGAAGAAC CAC CAC ATGAG GACAGT CAC CAACTACT T C AT AGT CA 390 

Qy 260 ACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGG 319 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 391 ACCTTTCCCTAGCAGATGTGCTTGTGACCATCACCTGCCTTCCAGCTACCCTCGTTGTTG 450 

Qy 320 ACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGG 379 

I I I I I I I I I I I I III I II II II I I I I I I I I I I I I I I I II III I I II I 
Db 451 ACATCACTGAGACTTGGTTCTTTGGACAGTCCCTCTGTAAGGTCATTCCTTATTTACAGA 510 

Qy 380 CTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATG 439 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 511 CTGTGTCAGTGTCTGTGTCTGTTCTTACGTTGAGCTGCATTGCCTTGGACCGATGGTACG 570 

Qy 44 0 CCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGG 499 

I I I I II I II I I I I I I I I I I I I I I I I I II I II I I I I I I I 

Db 571 CCATTTGTCACCCTTTGATGTTCAAGAGCACAGCCAAACGGGCTCGAAACAGCATCGTTG 630 

Qy 500 GCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCA 559 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 631 T CAT CT GGAT C GT CTC CT GC AT CAT AAT GAT T C CT CAAGC C AT T GT CAT G GAGT GCAGCA 690 

Qy 560 GTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGG 619 

I I I I I I I I I I I II Mill I I I I I I I I I I I I I I I I I 

Db 691 GCAT GCT CC CT G GC CTAGC CAAT AAGAC CAC C CT CT TTACAGT AT GT GAT GAAC ACT G GG 750 



Qy 

Db 



62 0 C AGAT GAC CTCT AT C C CAAGAT CT AC C AC AGT TGCTTCTT TAT T GT CAC C TAC CT GGC C C 679 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
751 GCGGTGAAGTTTACCCAAAGATGTACCATATCTGCTTCTTTCTGGTGACATACATGGCAC 810 



Qy 


680 


CACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGA 

J III II I'll llil III llll 1 1 1 1 1 1 1 1 11 llllll (III 1141 

1 1 M M Ml 1 1 I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 II 1 1 1 1 II 1 llll 

CTCTGTTTCTTATGATATTGGCTTATCTCCAAATATTCCGTAAACTCTGGTGCCGACAGA 


739 


Db 


811 


870 


Qy 


740 


TCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGG 
1 1 1 1 1 1 1 1 llll 1 1 1 1 I I I I | | | | I 1 1 1 1 1 1 1 
T T C CC G GAACTT CTT CT GT GGT T CAGAGAAAAT GGAAGC AGC AGCAGCCGGTTT 


799 


Db 


871 


924 


Qy 


800 


ACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAG 

II MM 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 
CTCAGCCCCGGGGGTCCGGACAGCAGAGCAAGGCTCGGGTTAGCGCTGTTGCTGCTGAGA 


859 


Db 


925 


984 


Qy 


860 


TGAAGCAGAT GC GT GCACGGAGGAAGACAGC CAAGAT GCT GAT GGT GGT GCTGCT GGT CT 
1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 llllll llllll 1 1 1 1 1 II M llllll! 
TAAAGCAGATCCGAGCACGAAGGAAAACAGCCCGGATGCTCATGGTTGTACTTCTGGTCT 


919 


Db 


985 


1044 


Qy 

Db 


920 
1045 


TCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGT 
1 II 1 1 II 1 1 1 1 II 1 1 1 II 1 1 1 1 | || | | | | | | | | | | | || | | | | | | | | | 
TTGCAATTTGCTATCTACCAATCAGCATCCTCAATGTGCTAAAGAGAGTATTTGGGATGT 


979 
1104 


Qy 

Db 


980 
1105 


TCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGG 
M M 1 llllll 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 II 1 
T C ACACAC AC GGAAGACAGAGAGACT GT CT AT GCTT GGTT CACT TT T CCT CAT T GGCT T G 


1039 
1164 


Qy 


1040 


TGTACGCCAACAGC-GCTGCCAACCCCATCATCTACAACTTCCTCAGTGG 1088 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II II II II II M I I 1 1 1 
TATATGCCAACAGCTGCTGCAAACCCAATTATTTATAATTTTCTTAGTGG 1214 




Db 


1165 





RESULT 9 

CF147830/c 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



Craniata; Vertebrata; Euteleostomi ; 
Catar rhini ; Hominidae ; Homo . 



CF147830 790 bp mRNA linear EST 25-JUL-2003 

AGENCOURT_14740202 NIH_MGC_145 Homo sapiens cDNA clone 
IMAGE: 6971889 5' , mRNA sequence. 
CF147830 

CF147830.1 GI:33244098 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates, - 
1 (bases 1 to 790) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Daniela S. Gerhard, Ph.D. 
Office of Cancer Genomics 
National Cancer Institute / NIH 
Bldg. 31 RmlOA07 Bethesda, MD 20892 
Email: cgapbs-r@mail.nih.gov 
'Tissue Procurement: GPCR Consortium 

cDNA Library Preparation: GPCR Consortium 



cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 
Plate: IRBI02 row: a column: 08 
High quality sequence start: 7 
High quality sequence stop: 738. 
FEATURES Location/Qualifiers 
source 1. .790 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clones "IMAGE: 6971889" 

/tissue_type="mixed" 

/lab_host="DH10B" 

/clone_lib= M NIH_MGC_145" 

/note="Vector : pcDNA3.1; Site_l: varies by clone; Site_2 : 
varies by clone; ORFs were PCR- amplified and cloned into 
pcDNA3.1 by the GPCR Consortium. Cloning sites vary by 
clone and include the following: 5 * -EcoRV-XmnI/XhoI-3 ' , 
S'-EcoRV-Xmnl/Notl-S', EcoRV (TA cloned, non-directional). 
For information about which gene each clones represents, 
please visit our anonymous ftp site at 

ftp : / /image . llnl . gov/image/rearrayed_plates/IRBI . preSV. dat 
a Note: this is a NIH_MGC Library." 

ORIGIN 



Query Match 45.1%; Score 500.6; DB 14; Length 790; 

Best Local Similarity 99.2%; Pred. No. 1.2e-91; 

Matches 503; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 



Qy 


583 


CGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATC 


642 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


790 


C GCAC AC GGCT CT T CT CAGTCT GT GAT GAAC GCT GGGCAGAT GAC CT CTAT C C CAAGAT C 


731 


Qy 


643 


TACCACAGTTGCTTCTTTATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCC 


702 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I 




Db 


730 


TACCACAGTTGCTTCTTTATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCC 


671 


Qy 


703 


TATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTG 


762 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


670 


TATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTG 


611 


Qy 


763 


GTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGA 


822 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 TM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


610 


GTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGA 


551 


Qy 


823 


GAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGG 


882 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1.1 I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 




Db 


550 


GAGCCCCAGCCCCGGGCCCGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGG 


491 


Qy 


883 


AAGACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATC 


942 






M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I | | | | | | | | | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 




Db 


490 


AAGACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATC 


431 


Qy 


943 


AGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAA 


1002 



Db 

Qy 

Db 

Qy 

Db 



I 1 1 I I I 1 1 I I I I I I I 1 1 I I I I I I I I I I I I 1 1 I I I I l I 1 1 I I I I I I I I 1 1 I I I I I I I | i M 

430 AGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAA 371 

1003 GCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAAC 1062 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

370 GCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAAC 311 

1063 C C CAT CAT C T AC AAC T T C C T C AGT G G C 1089 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 

310 CC CAT CAT CT ACAACT T CCT CAGT GG C 284 



RESULT 10 

AY420886 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

COMMENT 

FEATURES 

source 



gene 



ORIGIN 



AY420886 750 bp DNA linear GSS 17-DEC-2003 

Pan troglodytes HCRTR1 gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence. 
AY420886 

AY420886.1 GI:39776843 
GSS. 

Pan troglodytes (chimpanzee) 
Pan troglodytes 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostoml; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 

1 (bases 1 to 750) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., Ke jariwal, A. , 
Todd, M. A. , Tanenbaum, D.M. , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Inferring nonneutral evolution from human- chimp -mouse orthologous 
gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 750) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., Ke jariwal, A. , 
Todd, M. A. , Tanenbaum, D.M. , Civello, D. R. , Lu, F. , Murphy, B. , 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Car gill, M. 
Direct Submission 

Submitted ( 16-NOV-2003 ) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence as made by sequencing genomic exons and ordering them 
based on alignment. 

Location/ Qualifiers 

1. .750 

/organism="Pan troglodytes" 

/mol_type=" genomic DNA" 

/db__xref="t axon: 9598" 

<1. .>750 

/gene="HCRTRl" 

/locus_tag="HCM7373" 



Query Match 42.1%; Score 467.2; DB 29; Length 750; 

Best Local Similarity 83.7%; Pred. No. 7.6e-85; 

Matches 472; Conservative 0; Mismatches 92; Indels 0; Gaps 



0; 



Qy 526 ATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGC 585 

I I I I I I I I I I I I I I I I I I I I I I I I III I M I I I I I I I I I I I I I I I I I I I I ! I I I I I I I 

Db 1 ATGGTGCCCCAGGCTGCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGC 60 

Qy 586 AC ACGGCT CTT CT C ACTCT GT CAT GAACGCT GGGCAGAT GAC CT CTAT C C CAAGAT CTAC 645 

I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 AC ACGGCT CTT CT C AGT CT GT GAT GAACGCT G G GC AGAT GAC CT C TAT C C CAAGAT CTAC 120 

Qy 646 CACAGTTGCTTCTTTATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTAT 705 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 CACAGTTGCTTCTTTATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTAT 180 

Qy 706 TTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTG 7 65 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 TTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTG 240 

Qy 766 CGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAG 825 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 CGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAG 300 

Qy 826 CCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAG 8 85 

I I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 301 CCCCAGCCCCGGGCCCGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCGCGGAGGAAG 360 

Qy 886 ACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTOCTACCTCCCCATCAGC 945 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 ACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGT 42 0 

Qy 946 GTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCT 1005 

I I I I I I I I I I I I I I II I I I I 

Db 421 GTCCTCAATGTCCTTAAGAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 480 

Qy 1006 GTCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCC 1065 

I I I I I I I I I I I I I I I I I I I 

Db 481 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNACGCCAACAGNNNNGCCAACCCC 54 0 

Qy 1066 AT CATCT ACAACTTCCTCAGT GGC 1089 

I II I I I I I I I I I I I I I I I I I I 
Db 541 AT CANNNAC AACT T C C T CAGT GGC 564 



RESULT 11 

AY420887 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
• ORGANISM 



REFERENCE 
AUTHORS 



AY420887 726 bp DNA linear GSS 17-DEC-2003 

Mus musculus HCRTR1 gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence. 
AY420887 

AY420887. 1 GI : 39776844 
GSS. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
1 (bases 1 to 726) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., Kejariwal, A. , 
Todd, M. A. , Tanenbaum, D.M. , Civello, D. R. , Lu, F. , Murphy, B. , 
Ferriera,S., Wang,G., Zheng, X.H., White, T . J. , Sninsky, J. J . , 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

COMMENT 

FEATURES 

source 



gene 



Adams , M.D. and Cargill,M. 

Inferring nonneutral evolution from human- chimp-mouse orthologous 
gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 726) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., Kejariwal , A. , 
Todd, M. A., Tanenbaum,D.M. , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T.J,, Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
Direct Submission 

Submitted ( 16-NOV-2003 ) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence as made by sequencing genomic exons and ordering them 
based on alignment. 

Location/Qualifiers 

1. .726 

/organism="Mus musculus" 
/mol_type=" genomic DNA n 
/db_xref="taxon: 10090" 
<1. .>726 
/gene="HCRTRl" 
/locus_tag="HCM7373" 



ORIGIN 



Query Match 41.7%; 
Best Local Similarity 87.0%; 
Matches 508; Conservative 



Score 462.4; DB 29; 
Pred. No. 7.2e-84; 
0; Mismatches 76; 



Length 726; 
Indels 0; 



Gaps 



0; 



Qy 



Db 



526 ATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGC 585 
I M I I I M I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I Ml 
1 ATGGTGCCCCAGGCTGCTGTCATGGAGTGCAGCAGCGTGCTGCCTGAGCTAGCCAATCGC 60 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



586 ACAC GG CTCT T CT CACT CT GT CAT GAAC GCT GG G CAGAT GAC CT CT AT C C CAAGAT CTAC 645 

M MINI I I I I I I I I I I I I I I I I I I | I I I I I I I I I I II I 

61 ACCCGGCTCTTCTCTGTCTGTGATGAGCACTGGGCAGATGAACTCTACCCCAAGATCTAT 120 

64 6 CACAGTTGCTTCTTTATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTAT 705 

I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 CACAGCTGCTTTTTCATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCTATGGCCTAT 180 

706 TTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTG 765 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I || M I I I I I I I I I I I I I 

181 TTCCAGATCTTCCGCAAGCTCTGGGGCCGCCAGATCCCTGGTACCACATCAGCCTTGGTG 240 

766 CGGAACTGGAAGCGCCCGTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAG 825 

II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I 

241 CGGAACTGGAAACGGCCCTCGGAACAACTGGAGGCTCAGCACCAGGGCCTCTGTACAGAG 300 



Qy 



Db 



826 CCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAG 885 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I | I II I I I I I I I I I 

301 CCCCAGCCCCGGGCCCGAGCCTTCCTGGCTGAGGTGAAGCAGATGCGAGCTCGGAGGAAG 360 



Qy 

Db 



886 ACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGC 945 

M M I I I I I I I I I I II I I II MINIM II I I II I I I II II I I I I I I II 
361 ACGGCTAAGATGCTGATGGTAGTCCTGCTGGTTTTTGCACTCTGTTATCTGCCCATCAGT 420 



Qy 94 6 GTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCT 1005 

- I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I N I I I I I I I I Mill I M I I 

Db 421 GTCCTCAATGTCCTTAAGAGAGTGTTCGGGATGTTCCGCCAAGCCAGCGACCGGGAAGCC 480 

Qy 1006 GTCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCC 1065 

M M M M M M M M M M M M M M M I I M M M M M M M M M M I M I 

Db 481 GTCTACGCCTGCTTCACCTTCTCCCACTGGCTAGTGTACG'CCAACAGTGCCGCCAACCCT 54 0 



Qy 1066 ATCATCTACAACTTCCTCAGTGGCCTTCCCTGGAGTCTGCTCTA 1109 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I 

Db 541 AT CAT CTACAACTTCCT CAGTGGCAAATT CCGGGAGCAGTT CAA 584 



RESULT 12 
AL535838 

LOCUS AL535838 1001 bp mRNA linear EST 12-MAY-2003 

DEFINITION AL535838 Homo sapiens FETAL BRAIN Homo sapiens cDNA clone 

CS0DF013YE04 5-PRIME, mRNA sequence. 
ACCESSION AL535838 

VERSION AL535838.2 GI: 30542758 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 1001) 

AUTHORS Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
TITLE Full-length cDNA libraries and normalization 

JOURNAL Unpublished (2001) 
COMMENT On Feb 13, 2001 this sequence version replaced gi : 12799331. 

Contact: Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 151. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi -bin/cluster. cgi?seq=CS0DF013BC02QPl&cluster=151. r. Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DF013BC02QP1 . 
FEATURES Location/Qualifiers 
source 1. .1001 

/organism="Homo sapiens" 
/ mol_t y"pe= "mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DF013YE04" 
/tissue_type="FETAL BRAIN" 
Vdev_jstage=" fetal" 
/clone_lib="Homo sapiens FETAL BRAIN" 

/note="Organ: brain; Vector: pCMVSPORT_6; 1st strand cDNA 
was primed with a Notl-oligo (dT) primer. Five prime end 
enriched, double-strand cDNA was digested with Not I and 
cloned into the Not I and EcoRV sites of the pCMVSPORT 6 
vector. Library was not normalized." 

ORIGIN 



Query Match 41.7%; Score 462.4; DB 9; 

Best Local Similarity 91.0%; Pred. No. 8.3e-84; 
Matches 538; Conservative 13; Mismatches 31; 



Length 1001; 
Indels 9; Gaps 



6; 



Qy 377 AGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGT 436 

I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I : I I I I I I I I I I I I I 
Db 415 AGGCTGTGTCCGTGTCANTGGCAGTGCTAACTCTMANCTTCATCGCMCTGGACCGCTGGT 474 

Qy 437 ATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCC 496 

I I I I : I I I I I I : I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 475 ATGCHATCTGCYACCCACTATTGTTCAAGARCACAGCCCGGCGGGCCCGTGGCTCCATCC 534 

Qy 497 TGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCA 556 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II 
Db 535 TNNGNATCTGGGCTNTNTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGGAATGCA 594 

Qy 557 . GCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCT 616 

I I I I I I I II I I I I I I I til I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 595 GCAGTGTGCTGCCTNAGCTANCCAACCGCACACGGCTCTTCTCAGTCTGTGATGAACGCT 654 

Qy 617 GGGCAGAT GAC CT CT AT C C CAAGAT CTAC CACAGT T GCT T CT T TAT T GT CAC CT ACCT GG 676 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I | j | | | | | | 
Db 655 GGGCAGAT GAC CTCTATC C C AAGATCTAC CACAGT T GCTT CT TT AT T GT CAC CTAC CTGG 714 

Qy 677 CCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCC 736 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 715 CCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCC 774 

Qy 737 AGAT C C CC GGCAC CAC CT CAGCACT GGTGC GGAAC T GGAAGC GCCC CT C AGACCAGCT - G 795 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db" 775 AGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGG 834 

Qy 796 GGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCT 855 

II III I I II II II I II I I I I II III! I II I I I I I Illllll 

Db 835 GGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCC-- AGCCCCGGGCCGCGCCTTCCTGGCT 892 

Qy 856 GAAGT GAAGCAGATGC GTGCAC GGAGGAAGACAGC CAAGAT GCT GAT GGT GGT GCT G CT G 915 

MMIII : I I II I : I I I I I : I : I I I I I I II I I I I I I I I : I I I I I I I 

Db 893 GAAGT GA RCAGAT GST GCAGGCAGVAGACAS C S AAGATGCT GAT GGT GGB GCT GCT G 94 9 

Qy 916 GTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGG 966 

MMIII 1111111111:1 1111111:11 || | | | | | | | | 

Db 950 STCTTCG-CCTCTGCTACST-GCCATCAGSGT-CTCAATGTCTTAAAGAGG 997 

RESULT 13 
BM926746 

LOCUS BM926746 993 bp mRNA linear EST 12-MAR-2002 

DEFINITION AGENCOURT_6681991 NIH_MGC_121 Homo sapiens cDNA clone IMAGE : 5767576 

5', mRNA sequence. 
ACCESSION BM926746 

VERSION BM926746.1 GI : 19377125 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;, 



Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 993) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpublished (1999) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r @mail . nih . gov 
Tissue Procurement: Life Technologies, Inc. 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
http : / /image . llnl . gov 

Plate: LLAM12826 row: a column: 17 
High quality sequence stop: 684. 
FEATURES Location/Qualifiers 
source 1. .993 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone="IMAGE: 5767576" 

/lab_host="DH10B" 

/clone_lib="NIH_MGC_121" 

/note="0rgan: brain; Vector: pCMV-SP0RT6; Site_l: NotI; 
Site_2: EcoRV (destroyed); RNA source anonymous pool of 3 
fetal brains, female age 20 weeks, female age 24 weeks, 
and male age 26 weeks. Library is oligo-dT primed and 
directionally cloned (EcoRV site is destroyed upon 
cloning). Average insert size 1.7 kb, insert size range 
0.7-3.5 kb. Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
(Invitrogen) . Research Genetics tracking code 017. Note: 
this is a NIH_MGC Library." 

ORIGIN 

Query Match 34.7%; Score 385.2; DB 12; Length 993; 

Best Local Similarity 71.4%; Pred. No. 4.6e-68; 

Matches 507; Conservative 0; Mismatches 203; Indels 0; Gaps 0; 



Qy 8 0 AT GAAGATGAGTTT CTCCGCTATCTGTGGCGT GATTATCT GTACCCAAAACAGTAT GAGT 139 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 5 AC GAC GAGGAATT C CTGC GGT AC CT GT GGAG G GAAT AC CT GCAC C CGAAAGAATAT GAGT 204 

Qy 14 0 GGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCCCTGGTGGGCAACACGCTGG 199 

I I I I I I I I I I I I I II I I I I I I [ i I I I I I I I I I I I I | | | | M I 

Db 2 05 GGGTCCTGATCGCCGGGTACATCATCGTGTTCGTCGTGGCTCTCATTGGGAACGTCCTGG 264 

Qy 200 TCTGCCTGGCCGTGT GGC GGAAC C AC C ACAT GAGGACAGT C AC C AACT ACT T CAT T GT C A 259 

I II I I I I I I I I I I 11111111111111111111 I I I II I I II I I I I I I I II 
Db 265 T T T GT GT GGC AGT GT GGAAGAAC C AC CAC AT GAGGACGGT AAC CAACT AC T T C AT AGT CA 324 

Qy 2 60 ACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGG 319 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 325 ATCTTTCTCTGGCTGATGTGCTCGTGACCATCACCTGCCTTCCAGCCACACTGGTCGTGG 384 

Qy 32 0 ACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGG 379 



Db 



I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I II || I I I I I I I I I 

385 AT AT CACT GAGAC CT G GT T T T TT G GACAGT C CCTT T GCAAAGT GAT T C CTT AT CTACAGA 444 



Qy 


380 


Db 


445 


Qy 


440 


Db 


505 


Qy 


500 


Db 


565 


Qy 


560 


Db 


625 


Qy 


620 


Db 


685 


Qy 


680 


Db 


745 


Qy 


740 


Db 


805 



CTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATG 439 
I I I I I I I I I I I Ml I II II II II I I I I III II I I I I II I I I I I I I 
CCGTGTCGGTGTCTGTGTCTGTCCTCACACTGAGCTGTATCGCCTTGGATCGGTGGTATG 504 

CCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGG 4 99 

I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I Ml | 

CAAT CT GT C AC CCT TT GAT GT T T AAGAGCAC AGCAAAGC GG GC CCGT AACAGCAT T GT C A 564 

GCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCA 559 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

T CAT CT GGAT T GTCT C CT GC AT T ATAAT GAT T C CT C AGGCCAT CGT C AT GGAGT GCAGC A 624 

GTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGG 619 

Ml I II I I I II I I I I I I I I I I I I I I I I I I I I I 

CCGTGTTCCCAGGCTTAGCCAATAAAACCACCCTCTTTACGGTGTGTGATGAGCGCTGGG 684 

CAGAT GACCT CT AT C C CAAGAT CTAC CACAGT T GCT T CT TT ATT GTC AC CTAC CT GGC C C 67 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTGGTGAAATTTAT CCCAAGATGTACCACAT CT GTTT CTTTCT GGTGACATACAT GGCAC 744 

CACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGA 739 
I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I II I I II I I I I 
CACT GT GTCT CAT GGTGTTGGCTT AT CTGCAAAT ATT TCGCAAACTCTGGTGTCGACAGA 804 



I I I I I I I I I I I I I I I I I I I I I I I 



I I I 



I I I 



RESULT 14 

BX119589 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
COMMENT 



BX119589 543 bp mRNA linear EST 10-FEB-2003 

BX119589 Soares infant brain 1NIB Homo sapiens cDNA clone 
IMAGp998P20171 ; IMAGE: 40608, mRNA sequence. 
BX119589 

BX119589. 1 GI: 28289997 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 543) 
Ebert,L., Heil,0., Hennig,S 
Radelof,U., Schneider, D. and Korn,B 
Human UnigeneSet - RZPD3 
Unpublished (2003) 
Contact: Ina Rolfs 

RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Im Neuenheimer Feld 580, D-69120 Heidelberg, Germany 
RZPD; IMAGp998P20171. 

RZPDLIB; I.M.A.G.E. cDNA Clone Collection; 
Human UnigeneSet - RZPD3 (RZPDLIB No. 972) 
http: //www. rzpd.de/CloneCards/cgi- 

bin/showLib.pl. cgi/response?libNo=972 Contact: Ina Rolfs 



Neubert,P., Partsch,E.,. P'e'ters^M., 



RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Heubnerweg 6, D-14059 Berlin, Germany 
Tel: +49 30 32639 101 
Fax: +49 30 32639 111 
www. rzpd . de 

This clone is available royalty-free from RZPD; 

contact RZPD (clone@rzpd.de) for further information. Seq primer: 
M13u, Primer sequence: CGTTGTAAAACGACGGCCAGT . 
FEATURES Location/Qualifiers 
source 1. .543 

/organism="Homo sapiens" 
/mol_t ype= "mRNA" 
/db_xref="taxon: 9606" 

/clone="IMAGp998P20171 ; IMAGE: 40608" 
/sex="female" 

/dev_stage="73 days post natal" 
/lab_host="DH10B (ampicillin resistant)" 
/clone_lib="Soares infant brain 1NIB" 

/note="Organ: whole brain; Vector: Lafmid BA; Site_l: Not 
I; Site_2: Hind III; 1st strand cDNA was primed with a Not 
I - oligo(dT) primer [5 f 

AACTGGAAGAATTCGCGGCCGCAGGAATTTTTTTTTTTTTTTTTT 3 1 ] ; 
double-stranded cDNA was ligated to Hind III adaptors 
(Pharmacia), digested with Not I and directionally cloned 
into the Not I and Hind III sites of the Lafmid BA vector. 
Library went through one round of normalization. Library 
constructed by Bento Soares and M.Fatima Bonaldo." 

ORIGIN 



Query Match 33.9%; Score 376.2; DB 13; Length 543; 

Best Local Similarity 99.0%; Pred. No. 2.4e-66; 

Matches 378; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 



Qy 


i 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


158 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


217 


Qy 


61 


TCCC CT GT G CCT C CAGACTAT GAAGAT GAGTT T CT C C GCT AT CT GT GGC GT GAT T AT CT G 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 




Db 


218 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGCGATTATCTG 


277 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 




Db 


278 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


337 


Qy 


181 


CTG^TGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGA^AGTC 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


338 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


397 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


398 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


457 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 M I'l II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I | | | | | | | | | | Ml 




Db 


458 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGNAAG 


517 



Qy 361 GT CAT C C C CTAT CTACAGGCT G 382 

I I I II I I I I I H I I I I I I I I 
Db 518 GT CAT CCCCTATCTACAGACAG 539 



RESULT 15 
CF147829 

LOCUS CF147829 788 bp mRNA linear EST 25-JUL-2003 

DEFINITION AGENCOURT_l 47402 10 NIH_MGC_145 Homo sapiens cDNA clone 

IMAGE: 6971890 5', mRNA sequence. 
ACCESSION CF147 829 

VERSION CF147829.1 GI: 33244097 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 788) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpublished (1999) 
COMMENT Contact: Daniela S. Gerhard, Ph.D. 

Office of Cancer Genomics 
National Cancer Institute / NIH 
Bldg. 31 RmlOA07 Bethesda, MD 20892 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: GPCR Consortium 
cDNA Library Preparation: GPCR Consortium 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : / / image . llnl . gov 
Plate: IRBI02 row: a column: 09 
High quality sequence start: 9 
High quality sequence stop: 772. 
FEATURES Location/Qualifiers 
source 1. .788 

/organism="Homo sapiens" 

/mol_type= "mRNA" 

/db_xref="taxon:9606" 

/clone-"IMAGE: 6971890" 

/tissue_type="mixed" 

/lab_host="DH10B" 

/clone_lib-"NIH_MGC_145" 

/note="Vector: pcDtiA3.1; Site_l: varies by clone; Site_2 : 
varies by clone; ORFs were PCR- amplified and cloned into 
pcDNA3.1 by the GPCR Consortium. Cloning sites vary by 
clone and include the following: 5 T — EcoRV— Xmnl/Xhol — 3 1 , 
S'-EcoRV-Xmnl/Notl-S' , EcoRV (TA cloned, non-directional). 
For information about which gene each clones represents, 
please visit our anonymous ftp site at 

ftp : //image . llnl . gov/image/rearrayed_plates/IRBI . preSV. dat 
a Note: this is a NIHJVIGC Library." 

ORIGIN 



Query Match 



32.3%; Score 359; DB 14; Length 788; 



Best Local Similarity 72.0%; Pred. No. 9.2e-63; 

Matches 467; Conservative 0; Mismatches 182; Indels 0; Gaps ,0 

Qy 80 ATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTGTACCCAAAACAGTATGAGT 139 

I II M M M M M II I I I I I I I II II III MM III I II I II II 

Db 140 ACGACGAGGAATTCCTGCGGTACCTGTGGAGGGAATACCTGCACCCGAAAGAATATGAGT 199 

Qy 14 0 GGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCCCTGGTGGGCAACACGCTGG 199 

II I I M I I II M I II I I I i I II I II I II II II I II I II I M I 

Db 200 GGGTCCTGATCGCCGGGTACATCATCGTGTTCGTCGTGGCTCTCATTGGGAACGTCCTGG 259 

Qy 200 T CT G CCT GGC C GTGT GGC GGAACCAC CAC AT GAGGACAGT CACCAACT ACTT CATT GT CA 259 

I M I II I II II I I I M I II I I I II II I II I I II I I I II II II II I II II II 
Db 2 60 T TT GTGT GGCAGT GT GGAAGAAC CAC CAC AT GAGGAC GGTAAC CAACTACT T CAT AGT CA 319 

Qy 2 60 ACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGG 319 

I I I I Ml Ill I I II I II I I II II 

Db 320 ATCTTTCTCTGGCTGATGTGCTCGTGACCATCACCTGCCTTCCAGCCACACTGGTCGTGG 379 

Qy 320 ACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGG 379 

I I I II I II I Mill I II II II I II I I I I II II II M I II II II M 
Db 38 0 AT AT CAC C GAGACCT GGTT TTT T GGACAGT CC CT TT GCAAAGT GATT C CT TAT CT ACAGA 439 

Qy 380 CTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATG 439 

I I M I I Mill III I II II II II I II I III II MM II II I II I I 
Db 440 CCGTGTCGGTGTCTGTGTCTGTCCTCACACTGAGCTGTATCGCCTTGGATCGGTGGTATG 4 99 

Qy 440 CCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGG 499 

I I I I I I II I I I I II M M I I II I II II II II II I I M I III I 

Db 500 CAAT CT GT CAC C CT T T GAT GTT T AAGAGC ACAGCAAAGCGGGC C C GT AAC AGCATT GT C A 559 

Qy 500 GCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCA 559 

M II I I I I I I I I I I I I I I II I I || I I I I I I I I I I I I I I 

Db 560 TCATCTGGATTGTCTCCTGCATTATAATGATTCCTCAGGCCATCGTCATGGAGTGCAGCA 619 

Qy 560 GTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGG 619 

M M I I I I I II II II II Mill I I I II I I II I I I I II I 

Db 620 CTGTGTTCCCAGGCTTAGCCAATAAAACCACCCTCTTTACGGTGTGTGATGAGCGCTGGG 679 

Qy 620 CAGAT GACCTCTATCCCAAGAT CTACCACAGTT GCTTCTTTATT GTCACCTACCT GGCCC 679 

I I I I I I I II I I II I I I I I I II I I I I I I II I I I I I II I M I I II I 
Db 680 GT GGT GAAATT TAT C C CAAGAT GTAC CACAT CT GTTT CTTT CT GGT GACAT ACAT GGC AC 739 

Qy 680 CACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTG 72 8 

I M II I I II I II I I I I I I I I I I I II I II I I I I I I I 

Db 740 CACT GTGT CT CAT GGT GTT GGCT T ATCT GN CAAT AT T T CGCANACTCT G 788 



Search completed: October 15, 2004, 22:50:30 
Job time : 3232.12 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



October 15, 2004, 13:54:41 ; Search time 4497.38 Seconds 

(without alignments) 
10697.520 Million cell updates/sec 

US-10-070-532-3 
1110 

1 atggagccctcagccacccc ttccctggagtctgctctaa 1110 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters: 



6940544 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : 



GenEmbl : * 1 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



gb_ba : * 
gb_htg: * 
gb_in: * 
gb_om : * 
gb_ov: * 
gb pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb__ro : * 
gb_sts:* 
gb_s y : * 
gb_un : * 
gb__vi : * 
em_ba : * 
em_fun : * 
em_hum: * 
em_in : * 
em_mu : * 
em_om : * 
em_or : * 
em_o v : * 
emjpat : * 
em_ph : * 
em_pl : * 
em__ro : * 
em sts:* 



28 


» SIU 


un : * 


29 


i cm 


vi i * 


30 


* em 


H"t~rr Vmnri • + 
u 11 urn . 


O J. 


t em 


nig mv : 




* sin 


htg other : * 


o o 


» em 


htg mus : * 


O 1 


em 


ntg pin : * 


O O 


em 


nug rod : ^ 


o D 


em 


htg mam: * 


37 


em 


htrr vrt - • * 

11 L,U V -L » 


38 


em 


sy : * 


39 


em 


htgo hum:* 


40 


em_ 


htgo mus : * 


41 


em_ 


htgo other:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



Jo . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1110 


100. 


0 


1110 


6 


BD185453 


BD185453 Human neu 


2 


1102 


99. 


3 


1110 
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AR216118 


AR216118 Sequence 
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1079 . 4 


97. 
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1209 


6 
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AR216117 Sequence 
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1079 . 4 


97. 
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1564 
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E43974 Novel G pro 
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1079 . 4 


97. 
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1564 
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E50810 Novel G pro 
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E50811 Novel G pro 
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1079.4 
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1564 


6 


AX299475 


AX299475 Sequence 
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1079.4 


97. 
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1564 


6 


AX549082 


AX549082 Sequence 


10 


1079.4 


97. 


2 


1564 


6 


AX746121 


AX746121 Sequence 


11 


1079.4 


97. 


2 


1564 


6 


AX840912 


AX840912 Sequence 


12 


1079.4 


97. 


2 


1564 


9 


AF041243 


AF041243 Homo sapi 


13 


1078.4 


97. 


2 


1133 


6 


E43973 


E43973 Novel G pro 


14 


1078.4 


97. 


2 


1133 


6 


AX746120 


AX746120 Sequence 


15 


1078.4 


97. 
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1170 


6 


E43972 


E43972 Novel G pro 


16 


1078.4 


97. 


2 


1170 


6 


AX746118 


AX746118 Sequence 


17 


1077.8 


97. 


1 


1209 


6 


BD185452 


BD185452 Human neu 


18 


1075.2 


96. 


9 


1116 


6 


AR216119 


AR216119 Sequence 
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1075.2 


96. 


9 


1133 


6 


BD185454 


BD185454 Human neu 


20 


1074.6 


96. 


8 


1278 


6 


AX280925 


AX280925 Sequence 


21 


883.4 


79. 


6 


2200 


10 


AY336083 


AY336083 Mus muscu 


22 


881 


79. 


4 


2469 


10 


AF041244 


AF041244 Rattus lio 


23 


688 


62. 


0 


843 


6 


AR109899 


AR109899 Sequence 


24 


661 


59. 


5 


789 


6 


AR109632 


AR109632 Sequence 


25 


661 


59. 


5 


789 


6 


E12154 


E12154 cDNA encodi 


26 


661 


59. 


5 


789 


6 


AR300942 


AR300942 Sequence 


27 


571.8 


51. 


5 


781 


10 


AF394596 


AF394596 Mus muscu' 


28 


545.6 


49. 


2 


3114 


10 


AF041246 


AF041246 Rattus no 


29 


521.4 


47. 


0 


1545 


10 


AY336084 


AY336084 Mus muscu 


30 


521.4 


47. 


0 


2117 


10 


AY336085 


AY336085 Mus muscu 


31 


511.8 


46. 


1 


1633 


6 


E33974 


E33974 cDNA clone 


32 


511.8 


46. 


1 


1843 


6 


AX549084 


AX549084 Sequence 


33 


511.8 


46. 


1 


1843 


6 


AX840914 


AX840914 Sequence 
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AF4 99612 




O O 

z y o . z 
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637 
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AF394 59 / 


4 U 


z / o . z 


O A 

24 . 
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o o o 

3zo 


4 


AB0924 88 


41 


261 . 6 


23 . 


6 


501 


4 


AF532967 


42 


241.2 


21. 


7 


344 


9 


F202078S03 


43 


241.2 


21. 


7 


9785 


6 


AR178605 


44 


241.2 


21. 


7 


9785 


6 


AX088174 


45 


241.2 


21. 


7 


9785 


9 


AY062030 



AF041245 Homo sapi 
AX280927 Sequence 
AF164626 Canis fam 

AY255599 Mus muscu 
AF499612 Ovis arie 

AF394597 Mus muscu 
AB092488 Bos tauru 
AF532967 Ovis arie 
AF202080 Homo sapi 
AR178605 Sequence 
AX088174 Sequence 
AY062030 Homo sapi 



ALIGNMENTS 



RESULT 1 
BD185453 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



linear PAT 17-JUN-2003 



FEATURES 



ORIGIN 



BD185453 1110 bp DNA 

Human neuropeptide receptor. 
BD185453 

BD185453. 1 GI: 31877 653 

JP 2002360288-A/2. 

unidentified 

unidentified 

unclassified. 

1 (bases 1 to 1110) 

Soppet,D.R., Li,Y. and Rosen, C. A. 

Human neuropeptide receptor 

Patent: JP 2002360288-A 2 17-DEC-2002; 

HUMAN GENOME SCIENCES INC 

OS Unidentified 

PN JP 2002360288-A/2 

PD 17-DEC-2002 

PF 02-MAY-2002 JP 2002130838 

PI DANIEL R SOPPET,YI LI, CRAIG A ROSEN 

PC C12N15/09, A61K31/7088,A61K38/00,A61K45/00,A61K48/00, A61P3/04, 
PC A61P3/06, 

PC A61P3/10,A6lP9/10,A61P9/12,A61P25/08, A61P25/18, A61P25/22, PC 
A61P25/28, 

PC A61P35/00,A61P43/00, C07K14/705, C07K16/24 , C12N1/15, C12N1/19, PC 
C12N1/21, 

PC C12N5/10,C12Ql/68, C12N15/00, C12N5/00, A61K37/02 CC 

Strandedness : Single; 

CC Topology: Linear; 

CC Human neuropeptide receptor 

FH Key Location/Qualifiers 

FT source 1. .1110 

FT /organism= 1 Unidentified 1 . 

Location/Qualifiers 
1. .1110 

/ organism="unidentif ied" 
/mol_type=" genomic DNA" 
/db xref="taxon: 32644" 



Query Match 100.0%; Score 1110; DB 6; Length 1110; 

Best Local Similarity 100.0%; Pred. No. 2.6e-210; 

Matches 1110; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 TCCCCTGT GC CT C C AGACT AT GAAGAT GAGT T T CT C C GCT AT CT GT GGC GT GAT TAT CT G 120 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I 
Db 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I M I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

Qy 421 C C C CT GGACC GCT G GT AT GC C AT CT GC CAC C C ACT AT T GT T C AAGAGCAC AG CCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 C C CCT GGACC GCT GGTAT GCCAT CT GC C AC C CAC TAT T GT T CAAGAGCAC AGC C C GGC G G 4 80 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

Qy 601 CT CT GT CAT GAAC GCT GGGCAGAT GACCT CT AT C C CAAGAT CT AC C ACAGT TGCTTCTTT 660 

- ill I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ill I I I I I I 
Db 601 CT CT GT CAT GAACGCT GGGCAGAT GACCT CTAT C C CAAGAT CT AC C ACAGT TGCTTCTTT 660 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | I I I I I I I I I I I I 

Db 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 78 0 



Qy 



781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 7 81 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

Qy 841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 CGCGCCTT CCTGGCT GAAGT GAAGCAGAT GCGTGCACGGAGGAAGACAGCCAAGAT GCTG 900 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I II I I 
Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

Qy 1081 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 



RESULT 2 
AR216118 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



AR216118 1110 bp DNA 

Sequence 3 from patent US 6410701. 
AR216118 

AR216118.1 GI: 23314431 

Unknown . 

Unknown. 

Unclassified. 

1 (bases 1 to 1110) 

Soppet,D.R., Li, Y. and Rosen, C. A. 

Human neuropeptide receptor 

Patent: US 6410701-A 3 25-JUN-2002; 

Location/ Qualifiers 

1. .1110 

/ organism="unknown n 
/mol_type=" genomic DNA" 



linear PAT 25-SEP-2002 



Query Match 99.3%; 
Best Local Similarity 99.5%; 
Matches 1105; Conservative 



Score 1102; DB 6; 
Pied. No. le-208; 
0; Mismatches 5; 



Length 1110; 



Indels 



0; Gaps 



0; 



QY 
Db 



1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I 

1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 



Qy 

Db 



61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 TCCCCTGTGCCTC CAGAC TAT GAAGAT GAGT T T CTC C GCTAT CT GT GGC GT GATT AT CT G 120 



Qy 



121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






M 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Qy 


O A 1 
Z H. 1 


ALLAAC1AC1 1 CATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Db 


241 


1 M 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I I I | | | |'| | | | | | | | | | | | | | | | | | | | | 

ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Qy 


oUl 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 




Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Qy 


obi 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 


Qy 


A O 1 

4z 1 


C C C CT GGAC C GCT GGTAT G C C AT CT GC C AC C C ACT ATT GT T CAAGAGCACAGC C C GG C G G 


480 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 


Qy 


A O 1 

4 81 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 




Db 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 


Qy 




GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 






M 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


G C AGT CAT GGAAT GCAGC AGT GT GCT G C CT GAGCTAGCCAAC C G CACAC GGCT CTT C T CA 


600 


Qy 


oUl 


CT CT GT C AT GAAC GC T GGGC AGATGAC CT CT AT C C CAAGAT CT AC C ACAGTT GCTT CTTT 


660 






Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1.1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


GT CT GT GAT GAAC GCT GGGCAGAT GAC CT C TAT C CCAAGAT CT AC C ACAGT T GCTT CT T T 


660 


Qy 


DDI 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 


Qy 




AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 


Qy 


1 Q 1 
1 0 1 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 


yy 




CGCGCCTTCCTGGCT GAAGT GAAGCAGAT G C GT GC AC GGAG G AAG AC AGC CAAGAT GCT G 


900 






1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


841 


CGCGCCTTCCTGGCT GAAGT GAAGCAGAT GC GT GC AC GGAGGAAGACAGC CAAGATGCT G 


900 


Qy 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I | | 




Db 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 


Qy 


961 


AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1020 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | I I I I I I I I I I I I I 

Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 

Qy 1081 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 1081 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 



RESULT 3 
AR216117 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



AR216117 1209 bp DNA 

Sequence 1 from patent US 6410701. 
AR216117 

AR216117. 1 GI: 23314430 

Unknown. 

Unknown. 

Unclassified. 

1 (bases 1 to 1209) 

Soppet,D.R., Li,Y. and Rosen, C. A. 

Human neuropeptide receptor 

Patent: US 6410701-A 1 25-JUN-2002; 

Location/Qualifiers 

1. .1209 

/ organism="unknown" 
/mol_type=" genomic DNA" 



linear PAT 25-SEP-2002 



Query Match 97.2%; 
Best Local Similarity 99.4%; 
Matches 1083; Conservative 



Score 1079.4; DB 6; 
Pred. No. 3.1e-204; 
0; Mismatches 6; 



Length 1209; 
Indels 0; Gaps 



0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 



ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | 
ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | 
TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 12 0 

TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I | | | | | | | | | | | | | | | | | | | | | | | | | | 

tacccaaa/^cagtatgagtgggtcctcatcgcagcctatgtggctgtgttcgtcgtggcg ;i 8 0 

CT GGT G GGCAAC AC G CT GGT CT GCCT GGC C GT GT G G C GGAACCACCAC ATGAG GACAGT C 2 4 0 

M III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | | | | | | | | | | | | | | | | | M I I 

CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 



Qy 



301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 
I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 



.Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 




O D -L 


Li LAI LLCC1A1 LIACAGGC1 Gl Gl CCG1 GTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






i [ 1 1 1 t i i i i i i i i i i i i i i i i i t i i i t i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i 
I I ' 1 1 1 1 1 1 1 M M 1 M 1 1 1 1 1 M 1 1 M 1 1 1 1 1 II II II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 


yy 




ULCulGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






I I I I i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i 
N 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I | | | | | | | | | || | | M | | | | | | | | | | | 




Db 


421 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 


yy 


A Q 1 


GLCCbl GGO 1 CCA1 CC1 GGGCArCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 N M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 I I I I 1 1 1 




Db 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 


yy 




GLAG1 CAI GCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 






i ( t i i i i i t lit i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 M 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 M | | | | | | | | | | | || U 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 


Qy 


DU1 


CT CT GT CAT GAAC GCT GGGCAGAT GAC CT CT AT C C CAAGAT CT AC C AC AGT T GCT T CTTT 


660 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ■ i i i i 

M 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | || | | | | | | | | | 




Db 


601 


GT CT GT GAT GAAC GCT GGGCAGAT GAC CT CT AT C C CAAGAT CTAC C AC AGT TGCTTCTTT 


660 


Qy 


DDI 


AI I GI CACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






i i i i i i i i i i i i i i i i i i i r i i i i i i i i i i i i i i ■ i i i i i i i ■ i i * i ■ i i 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M I I I | I | | | | | | | | | | | | | | | | | | | | | | | | | 




Db 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCT'CATGGCCATGGCCTATTTCCAGATATTCCGC 


720 


yy 


/ Z 1 


AAGC1 CI GGGGOCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 






i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | 




Db 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 


yy 


/ O 1 


CLCI CAGACLAGCI GGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 i 1 1 1 1 i 1 1 1 1 1 1 i 1 1 l l f l i i i 
1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 




Db 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 


wy 


O 4 ± 


CGCGCL 1 GGC I GAAGTGAAGCAGATGCGTGCACG GAG GAAGACAGC CAAGAT GCTG 


900 






i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i ■ 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


841 


CGCGCCTT C CTGGCT GAAGTGAAGCAGAT GCGT GCACGGAGGAAGACAGC CAAGATGCTG 


900 


yy 


q m 


AJ. GG1 GG1 GG1 CI I CGCCC1 LI GCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 






1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 l l l l l I I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 


960 


yy 


Q £1 


AAbAbbb 1 G 1 1CGGGA1 Gl TCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1020 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i 
1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | || | | | | | | | | | | | | | | 




Db 


961 


AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1020 


Qy 


1021 


ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 


1080 






1 1 1 1 1 H 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | I I I I I | | | | | | | | | | | | | | | | | | | | || M 




Db 


1021 


ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 


1080 


Qy 


1081 


CTCAGTGGC 1089 




Db 


1081 


1 1 1 1 1 1 1 1 1 

CTCAGTGGC 108 9 





RESULT 4 
E43974 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



E43974 1564 bp 

Novel G . protein-coupled receptor 
E43974 

E43974.1 GI:18625173 

JP 2000106888-A/3. 

unidentified 

unidentified 

unclassified. 

1 (bases 1 to 1564) 

Bergsma, D.J. and Ellis, C.E. 

Novel G protein-coupled receptor 



DNA linear 
(HFGAN72Y) . 



PAT 31-JAN-2002 



(HFGAN72Y) 



Patent: JP 2000106888-A 
SMITHKLINE BEECHAM CORP 
OS 
PN 
PD 
PF 
PR 
PI 



3 18-APR-2000; 



Unidentified 
JP 2000106888-A/3 
18-APR-2 000 

21-JUL-1999 JP 1999206116 
30-APR-1997 US 08/846705 
DERK J BERGSMA, CATHARINE ELIZABETH ELLIS 



PC 



C12N15/09,A61K38/00,A61K38/00,A61K45/00,A61K48/00,A61P1/00, PC 
A61P1/14, 

PC A61P9/02,A6lP9/04,A61P9/10,A61P9/12,A61Pll/06,A61P13/02, PC 
A61P13/08, 

PC A61P19/10,A61P25/14,A61P25/16,A61P25/18,A61P25/22,A61P25/24, 
PC A61P31/04, 

PC A61P31/10, A61P31/12,A6lP31/18, A61P33/00, A61P35/00, A61P37/08, 
PC A61P43/00, 

PC C07K14/705, C07K16/28, C12N1/21, C12N5/10, C12P21/02, G01N33/566, 
PC G01N33/577// 

PC C12P21/08, (C12N15/09,C12R1:91), (C12P21/02 , C12R1 : 91 ) , C12N15/00, 
PC A61K37/02, 

PC A61K37/02,C12N5/00, (C12N15/00, C12R1 : 91) 
CC Strandedness : Single; 
CC Topology: Linear; 

FH Key Location/Qualifiers 
FT source 1. .1564 

FT /organism= 'Unidentified 1 . 

Location/Qualifiers 
1. .1564 

/ organism="unidentif ied" 
/mol_type= f, genomic DNA" 
/db xref="taxon: 32644" 



Query Match 97.2%; Score 1079.4; DB 6; 

Best Local Similarity 99.4%; Pred. No. 3e-204; 
Matches 1083; Conservative 0; Mismatches 6; 



Length 1564; 



Indels 



0; Gaps 



0; 



Qy 



Db 



1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 



Qy 



Db 



61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 



Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

I I I I I I M! I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 334 CTGGTGGG CAACAC GCTGGTCTGCCTGGCCGTGT GGCGGAAC C AC CAC AT GAGGACAGTC 393 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 454 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 42 0 

I M I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | II | | | | | | | | 
Db 514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 57 3 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

Qy 4 81 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I MINI 

Db 634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 754 GT CT GT GAT GAAC GCT GGGCAGAT GAC CT CT AT C C CAAGAT CT AC CAC AGT TGCTTCTTT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 934 CC CT C AGAC CAGCT GGG GGACCTGGAGC AG G G C CT GAGTGGAGAG C C C C AGC C C C GG GGC 993 

Qy 841 CGCGCCTTCCT GGCT GAAGT GAAGC AGAT GC GT GC ACGGAGGAAGAC AG C CAAGAT GCT G 900 

Ml I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 994 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 



Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 
I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I I I I I I I I I I 



Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 108 9 

I I I I I I I II 
Db 1234 CTCAGTGGC 1242 



RESULT 5 

E50810 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



E50810 1564 bp DNA linear PAT 18-JUN-2001 

Novel G protein-bound receptor (HFGAN 72X) . 

E50810 

E50810.1 GI:13023197 

JP 2000060578-A/l. 

unidentified 

unidentified 

unclassified. 

1 (bases 1 to 1564) 

Derk,J.B. and Catharine , E. E. 

Novel G protein-bound receptor (HFGAN 72X) 

Patent: JP 2000060578-A 1 29-FEB-2000; 

SMITHKLINE BEECHAM CORP 

OS Unidentified 

PN JP 2000060578-A/l 

PD 29-FEB-2000 

PF 21-JUL-1999 JP 1999206115 

PR 30-APR-1997 US 08/846704 

PI DERK J BERGSMA, CATHARINE ELIZABETH ELLIS 

PC C12N15/09,A61K31/70,A61K38/00,A61K39/00,A61K39/395,A61K39/395, 
PC A61K45/00, 

PC A61K48/00, A6lP3/04,A61P9/00,A61Pll/06,A6lPl3/00, A61P25/00, PC 
A61P25/16, 

PC A61P25/18,A61P25/20,A61P25/22,A61P31/04, A61P31/10, A61P31/12, 
PC A61P31/18, 

PC A61P35/00,A61P37/00,C07Kl4/705, C12N5/10, C12P21/02, C12Q1/02, PC 
G01N33/53, 

PC G01N33/566//C07Kl6/28,C12N15/00,A61K37/02, C12N5/00 CC 
Strandedness : Single; 
CC Topology: Linear; 

FH Key Location/Qualifiers 
FT source 1. .1564 

FT /organism^ 'Unidentified 1 . 

Location/Qualif iers 
1. .1564 

/organism="unidentified" 
/mol_type=" genomic DNA" 
/db xref="taxon: 32644" 



Query Match 97.2%; . Score 1079.4; DB 6; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 3e-204;. 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 



0; 



wy 




•"--L vjvj^-\kjv^v^v^ i ^/\^^^/^L*^^^/\VaLr^Lr^^L-/\<a/-\.I bbrbrobjl LLLLLLJ. b Kj b Ab L Ab Ab Ab CCb 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l i i 

■ 1 I 1 I I I II II 1 1 11 1 1 1 1 1 M 1 1 II 1 i 1 II II 1 1 1 IE 1 1 1 II II 1 1 II 1 ! 1 MINIMI 




Db 


154 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


213 


Qy 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 






1 1 1 1 1 1 1 M M M M M M M 1 M M 1 M M 1 M M 1 M M M M M M M 1 1 M M M 1 




Db 


214 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


273 


wy 


191 


1 ALLLAAAALAb 1 Al bAbl bbbl LL1 LAI LbLAbLLl ATGTGGCTGTGTTCGTCGTGGCC 
1 I I 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t i i i i i i i l i i i i i t i i i i i i i i i i i i i i i i i i i i i 


180 


Db 


274 


M I I I 1 II 1 1 1 M 1 1 1 1 I M 1 1 1 1 1 M M 1 M 1 1 1 1 1 1 1 M 1 M M M M M M M M M 

TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


333 


wy 


1 ft 1 

J. o ± 


i bbi yj i^biUAALAL bbl bbl b J. bLL 1 bbbLb I b 1 bbbbbAALLALLACATGAGGACAGTC 


240 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 M 1 1 i M 1 1 1 1 1 1 1 1 1 1 1 1 II II II II II II II M M 1 | M M | | | | | | | | | | | | | | | | 




Db 


334 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


393 


wy 


9 d 1 
Z *i J. 


AbbAAblALl l bAi 1 bl LAALLlbi LLL 1 bbL I GACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

I 1 1 1 1 1 1 M 1 M II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 




Db 


394 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


453 


wy 


^ni 


LLbbbLAbbLl bbl bbl bbALAl LALIGAbl LLTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






j i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


454 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


513 


wy 




blLAlLLLLl AI LIALAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 i 1 i 1 l l l l l l l l l l l l i i i i i i i i i i i i i t t i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 M 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 I I I M 




Db 


514 


GT CAT CCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTT CATC 


573 


wy • 


4Z J. 


LLLL1 LLALLGL1 GG1 AI GCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


.480 






1 1 1 1 1 1 M M II 1 1 1 f 1 1 1 II 1 1 1 1 1 II M 1 1 1 M 1 1 II 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 




Db 


574 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


633 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


634 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


693 


wy 


Oil 


bbAbl LAI LLAA1 LLAGLALi GI GL 1 GLCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 
i i i i j \ i i i ill i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i \ i i i 


600 


Db 


694 


1 1 1 1 1 1 1 1 1 III 1 1 1 II 1 1 1 1 1 1 1 I I I I I I | | | | | | | | | | | | | | | | || | M | | | | | | | 

GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


753 


wy 


fin i 

DUX 


LI LI Gl LAlLrAALLLl bGGLAGArGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 


660 






i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
M 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I M 1 1 1 1 1 




Db 


754 


GT CT GT GAT GAAC G CT GGGC AGAT GAC CT CT AT C C CAAGAT CTACCACAGT TGCTTCTTT 


813 


wy 


DDI 


Al Ibl LALLlALLi GGLLLLALI GGGCLTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






1 i 1 1 i i i i i i i i i ■ .1 i i i i j i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i t i i i i i « 
1 1 1 1 1 1 1 M 1 1 II i 1 1 1 1 1 II 1 1 1 | | | | | | | | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I 




Db 


814 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


873 


wy 


791 


AAbrU ibl bbbbbbbbbAbAl LLLLGGLALLALL 1 LAGCACTGGTGCGGAACTGGAAGCGC 


780 






1 i 1 1 1 1 1 1 1 1 i 1 ! 1 1 1 1 1 t 1 I 1 i 1 l l l l l l l l i l l i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I | M M 1 1 1 1 1 1 1 1 1 1 1 1 II M 




Db 


874 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


933 


Qy 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


ft 4 n 






1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I | I | M | | 




Db 


934 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


993 


Qy 


841 


CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 


900 



Db 



I I I I I I I I I I I I I I I I I I > M > M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

994 CGCGCCTT CCT GGCT GAAGT GAAGCAGAT GCGTGCAC GGAGGAAGACAGCCAAGATGCTG 1053 



Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I 

Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

QY 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I II I I I I I I I I II I II I I I I I I I | | | | I I I I I I I I I I I I I I I I I I I 
Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 1089 

I I I I I I I I I 
Db 1234 CTCAGTGGC 1242 



RESULT 6 
E50811 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



PAT 18-JUN-2001 



E50811 1564 bp DNA linear 

Novel G protein-bound receptor (HFGAN 72X) . 
E50811 

E50811.1 GI:13023198 
JP 2000060578-A/2. 
unidentified 
unidentified 
unclassified. 
1 (bases 1 to 1564) 
Derk, J.B. and Catharine, E. E . 
Novel G protein-bound receptor (HFGAN 72X) 
Patent: JP 2000060578-A 2 29-FEB-2000; . 
SMITHKLINE BEECHAM CORP 
OS Unidentified 
PN JP 2000060578-A/2 
PD 29-FEB-2000 
PF 21-JUL-1999 JP 1999206115 
PR 30-APR-1997 US 08/846704 
PI DERK J BERGSMA, CATHARINE ELIZABETH ELLIS 

PC C12N15/09,A61K31/70,A61K38/00,A61K39/00,A61K39/395,A6lK39/395, 
PC A61K45/00, 

PC A61K48/00,A61P3/04, A61P9/00, A61P11/06, A61P13/00, A61P25/00, PC 
A61P25/16, 

PC A61P25/18,A61P25/20,A61P25/22,A6lr 31/04 , A61P31/10, A61P31/12, 
PC A61P31/18, 

PC A61P35/00,A61P37/00,C07K14/705,C12N5/10,C12P21/02, C12Q1/02, PC 
G01N33/53, 

PC G01N33/566//C07K16/28,C12N15/00,A61K37/02,C12N5/00 CC 
Strandedness : Single; 
CC Topology: Linear; 

FH Key Location/Qualifiers 
FT source 1. .1564 

FT /organism^ 1 Unidentified 1 . 

FEATURES Location/Qualifiers 
source 1. .1564 



/organism="unidentif ied" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 32644" 

ORIGIN 

Query Match 97.2%; Score 1079.4; DB 6; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 3e-204; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I 

Db 154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

Qy 61 T C CC CT GT GC CT C C AGAC TAT GAAG AT GAGT T T CT C C GCT AT CT GT GGC GT GATT AT CT G 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGGCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 334 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 454 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

Qy 361 GT CAT C C C CT AT CT AC AGG CT GT GT CC GT GTCAGT GGCAGT GCT AACT CT CAGCTTCAT C 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

Qy 421 C C C CT G GAC C GCT GGT AT GC C AT CT GC C AC C CACT AT T GT T CAAGAGC AC AG CCCGGCGG 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I 
Db 574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

Qy 541 GCAGTGATGCAATCCAGCAGT GT GCT GCCTGAGCT AGCCAACCGCACACGGCT CTTC3;CA 600 

I M I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

Ml I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

Db 754 GT CT GT GATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTT GCTT CTTT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



721 



874 



781 



934 



841 



AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I * 

AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGC'CCCAGCCCCGGGGC 993 



900 



CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

994 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 



901 



960 



ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 
I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

1081 CTCAGTGGC 1089 

I I I I I I I I I 
1234 CTCAGTGGC 1242 



RESULT 7 
AX299473 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



AX299473 1564 bp DNA linear PAT 26-NOV-2001 

Sequence 1 from Patent EP1154019. 

AX299473 

AX299473.1 GI:17129230 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini ; Hominidae ; Homo. 

1 

Bergsma, D. J. and Ellis, C.E. 
G-protein coupled receptor (hfgan72x) 
Patent: EP 1154019-A 1 14-NOV-2001; 
SmithKline Beecham Corporation (US) 

Location/Qualifier^ 

1. .1564 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db xref="taxon: 9606" 



Query Match 97.2%; 
Best Local Similarity 99.4%; 
Matches 1083; Conservative 



Score 1079.4; DB 6; 
Pred. No. 3e-204; 
0 ; Mismatches 6; 



Length 1564; 
Indels 0; Gaps 



0; 



Qy 



1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

Qy 61 T C C C CT GT GC CT CCAGACT AT GAAGAT GAGT T T CT C C GCT AT CTGT GGCGT GATT AT CT G 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 27 3 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | I I I 
Db 274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 334 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 454 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I 
Db 514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

Qy 541 GC AGT CAT G C AAT C C AGCAGT GTGCTGCCT GAGCT AGC CAAC C GC ACAC GGCT CT T CT C A 600 

I I I I I M I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CT CT GT CAT GAAC G CT GGGCAGAT GACCT CT AT C C CAAGAT CT AC CACAGT TGCTTCTTT 660 

IMM I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 754 GT CT GT GAT GAAC GCT GGGCAGAT GAC CT CT AT C C CAAGAT CT AC CACAGT T GCTT CTTT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I M I I I M I I I ! I I I I I I I I I I I I I M I I I I I I I I I I ! I I I I I I I I I I I 

Db .814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCC^iGATATTCCGC 873 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

M I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

Qy 841 CGCGCCTTCCT GGCT GAAGT GAAGCAGATGCGT GCAC GGAGGAAGACAGC CAAGAT GCT G 900 
I I I I I M I I I M I I I I I Ml I I I I I I I I I I I I I || | | | | | | | | | | | | | | M | || | 



Db 



994 CGCGCCTTCCT G GCTGAAGT GAAG C AGATGCGT G C AC GGAGGAAGACAGCCAAGAT GCT G 1053 



Qy 


901 


Db 


1054 


Qy 


961 


Db 


1114 


Qy 


1021 


Db 


1174 


Qy 


1081 


Db 


1234 



ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 
I I I I I ! I I I II II I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I I I I I I I I I 
ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 



I.I I I I I I I I 



RESULT 8 
AX299475 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



AX299475 
Sequence 3 
AX299475 
AX299475. 1 



1564 bp 
from Patent EP1154019. 

GI:17129231 



DNA 



linear PAT 26-NOV-2 001 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Bergsma,D.J. and Ellis, C.E. 
G-protein coupled receptor (hfgan72x) 
Patent: EP 1154019-A 3 14-NOV-2001; 
SmithKline Beecham Corporation (US) 

Location/Qualifiers 

1. .1564 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db xref="taxon:9606" 



Query Match 97.2%; Score 1079.4; DB 6; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 3e-204; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 154 AT GGAG CCCT C AGC C AC C C C AGGGGC C C AGAT GG GGGT C C CC C CT GGCAG C AGAGAGC C G 213 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 



181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 
I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | 

334 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 

241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 

301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4 54 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | I I I I I II I I I I I I I I I 
514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 4 80 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | M I I I I 
694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

601 CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
754 GT CT GT GAT GAACGCT GGGCAGATGACCTCT AT CCCAAGAT CTACCACAGTTGCTTCTTT 813 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I II I I I I I II I I I I I I I I I I I I I II II I I I I I I I I I I I I I II I I I I I I I I I | | | | | | | | 
814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | 
874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I 
934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

841 CGCGCCTTCCT G GCT GAAGT GAAGC AGAT GC GT GC ACGGAGGAAGAC AGCCAAGAT GCT G 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | M I I I I I I I I I I I I I I I I 
994 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 

901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | 
1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

111.4 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 



Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 1089 

I I I I I I I I I 

Db 1234 CTCAGTGGC 1242 



RESULT 9 
AX549082 

LOCUS AX549082 1564 bp DNA linear PAT 26-NOV-2002 

DEFINITION Sequence 367 from Patent WO02061087. 
ACCESSION AX549082 

VERSION AX549082.1 GI:25813851 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Burmer,G.C, Roush,C.L. and Brown, J. P. 

TITLE Antigenic peptides, such as for G protein-coupled receptors 

(GPCRs), antibodies thereto, and systems for identifying such 
antigenic peptides 
JOURNAL Patent: WO 02061087-A 367 08-AUG-2002; 
Lifespan Biosciences, Inc. (US) 
FEATURES Location/Qualifiers 
source 1. .1564 

/organism="Homo sapiens" 
/mol__type="unassigned DNA" 
/db_xref="taxon: 9606" 

ORIGIN 

Query Match 97.2%; Score 1079.4; DB 6; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 3e-204; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 

ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I | | | | | | M I I I I I I I I I I 

ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | | | | | | | | | | | | 
TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 

TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 

ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | 
ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 



Qy 


l 


Db 


154 


Qy 


61 


Db 


214 


Qy 


121 


Db 


274 


Qy 


181 


Db 


334 


Qy 


241 


Db 


394 



Qy , 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 54 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I | M I I I 
Db 574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I 

Db 634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

Qy 541 GCAGT CAT GCAAT C CAGCAGT GTGCTGCCTGAGCTAGCCAACCGCACACGGCT CTT CT CA 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CT CT GT CAT GAAC GCT GG GCAGAT GAC CT CT AT CCCAAGAT CT AC C ACAGTT GCT T CT T T 660 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I MINIMI 

Db 754 GT CT GT GAT GAAC GCT GGG CAGAT GAC CT CT AT CCCAAGAT CT AC C ACAGT TGCTTCTTT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I M I I I I I II I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I | | | | I I I I I I I I I I I I I 

Db 814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I | | | 
Db 934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

QY 841 C GCGCCTT CCTGGCT GAAGT GAAGCAGAT GCGTGCACGGAGGAAGACAGCCAAGAT GCT G 900 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 ll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 994 CGCGCCTTCCTGGCT GAAGT GAAGCAGAT GC GT GCAC G GAG GAAGAC AGC CAAGAT G CT G 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I M I I I I I I I I I I I I I I I I II I I I I II I I I 111111111111111111111111 
Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy " 1081 CTCAGTGGC 1089 

I I I I I II II 
Db 1234 CTCAGTGGC 1242 



RESULT 10 
AX746121 

LOCUS AX746121 1564 bp DNA linear PAT 12-JUN-2003 

DEFINITION Sequence 4 from Patent EP1156110. 
ACCESSION AX746121 

VERSION AX746121.1 GI:31744927 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Bergsma,D.J. and Ellis, C.E. 

TITLE G-protein coupled receptor (HFGAN72Y) 

JOURNAL Patent: EP 1156110-A 4 21-NOV-2001; 

SMITHKLINE BEECHAM CORPORATION (US) 
FEATURES Location/Qualifiers 
source 1. .1564 

/organism-"Homo sapiens" 
/mol_type="genomic DNA" 
/db__xref="taxon: 9606" 
/note="HGS EST 554692" 

ORIGIN 

Query Match 97.2%; Score 1079.4; DB 6; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 3e-204; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 154 ATGGAGC'CCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I || I I I I I I I I I I 
Db 274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

QY 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 334 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTcir'CTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 454 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 



Qy 

Db 



361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I | | I I I I I I I I I I I I I I I I I I I I I I 

514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I II I I Ml I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || | 
694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

601 CT CT GT CAT GAACGCTGGGCAGAT GACCTCTAT CCCAAGAT CTACCACAGTTGCTT CTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

754 GT CT GT GAT GAAC GCT GG G CAGAT GAC CT CT AT C C CAAGAT CT AC C ACAGTT GCTTCTTT 813 



661 



814 



721 



ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 



AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 



841 



994 



901 



CGCGCCTTCCT GGCT GAAGT GAAG C AGATGC GT GC AC GGAGGAAGACAGCCAAGAT GCT G 900 

I M I I I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 



ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

1081 CTCAGTGGC 1089 

I I I II I I I I 
1234 CTCAGTGGC 1242 



RESULT 11 

AX840912 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 



AX840912 1564 bp 

Sequence 8 from Patent WO03075945. 
AX840912 

AX84 0912. 1 GI: 39979051 
Homo sapiens (human) 



DNA 



linear 



PAT 16-DEC-2003 



ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Eulenberg,K. , Steuernagel , A. , Haeder,T. and Broenner.G. 

TITLE Cg8327, cgl0823, cgl8418, cgl5862, cg3768, cgll447 and cg!6750 

homologous proteins involved in the regulation of energy 

homeostasis 

JOURNAL Patent: WO 03075945-A 8 18-SEP-2003; 

DeveloGen Aktiengesellschaf t fuer entwicklungsbiologische; 
Forschung (DE) 
FEATURES Location/Qualifiers 
source 1. .1564 

/organism= H Homo sapiens" 
/mol_type="unas signed DNA" 
/db_xref="taxon: 9606" 

ORIGIN 



Query Match 97.2%; Score 1079.4; DB 6; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 3e-204; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 



Qy 


i 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


154 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


213 


Qy 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 






1 1 1 1 1 1 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | 




Db 


214 


T C C C CT GT GC CT CCAGACT AT GAAGAT GAGT T T CT C C GCT AT CTGT GGC GT GAT TAT CTG 


273 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


274 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


333 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


334 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


393 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | || 




Db 


394 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


453 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


454 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


513 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


514 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


573 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


574 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


633 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


634 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


693 



Qy • 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

MINIMI Ml I I I I I I I I I I I I I I | I | | | | | | | | | | | I I I I I I I I I I I I I I I I I I I 
Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CTCTGTCAT GAACGCTGGGCAGATGACCT CTAT CCCAAGAT CTAC CACAGTTGCTTCTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I M M I I I I I I I M I I I I I I I I I I I I 
Db 754 GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTAC CACAGTTGCTTCTTT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I | | I I | 

Db 814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 78 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 874 AAGCT CT GGGGC CGCCAGAT C C C C G GC AC CAC CT CAG C ACT GGT GC GGAACT G GAAG C G C 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II II I I I I II I I I I 
Db 934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

Qy 841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 994 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I II I I I I 
Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 1089 

I I I I I I I I I 
Db 1234 CTCAGTGGC 1242 



RESULT 12 

AF041243 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



AF041243 1564 bp mRNA linear PRI 24-FEB-1998 

Homo sapieno 'orexin receptor-1 mRNA, complete cds . 

AF041243 

AF041243.1 GI:2897123 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1564) 

Sakurai,T., Amemiya,A., Ishii,M., Matsuzaki , I . , Chemelli, R.M. , 

Tanaka,H. , Williams , S . C . , Richardson, J. A. , Kozlowski , G . P . , 

Wilson, S., Arch,J.R.S., Buckingham, R. E . , Haynes,A.C, A. Carr,S.A. , 



TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



FEATURES 

source 



CDS 



Annan, R.S., McNulty, D . E . , Liu,W.-S., Terrett, J. A. , 
Elshourbagy, N. A. , Bergsma, D.J. and Yanagisawa, M. 
Orexins and orexin receptors: a family of hypothalamic 
neuropeptides and G protein-coupled receptors that regulate feeding 
behavior 

Cell 92 (4) , 573-585 (1998) 

98150861 

9491897 

2 (bases 1 to 1564) 

Sakurai,T., Amemiya,A. , Ishii,M., Matsuzaki, I . , Chemelli, R. M. , 

Tanaka,H., Williams, S . C. , Richardson, J. A. , Kozlowski, G . P . , 

Wilson, S., Arch,J.R.S., Buckingham, R. E . , Haynes,A.C, A. Carr,S.A., 

Annan, R.S., McNulty, D . E . , Liu,W.-S., Terrett, J. A. , 

Elshourbagy, N. A. , Bergsma, D.J. and Yanagisawa, M. 

Direct Submission 

Submitted ( 07- JAN-1998 ) HHMI /Department of Molecular Genetics, 
University of Texas Southwestern Medical Center at Dallas, 5323 
Harry Hines Blvd., Rm. Y5.224, Dallas, TX 75235-9050, USA 

Location/Qualifiers 

1. .1564 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon:9606" 
/chromosome="l" 
7map="lp33" 
154. .1431 

/note="OXlR; G protein-coupled receptor" 
/ codon_start=l 

/product="orexin receptor-1" 
/protein_id-"AAC39601. 1" 
/db_xref="GI:2897124" 

/ trans lation="MEPSATPGAQMGVPPGSREPSPVPPDYEDEFLRYLWRDYLYPKQ 
Y EWVL I AAYVAVFWAL VGNT L VC LAVW RN H HMRT VTN Y F I VN L S LAD VL VT AI C L PA 
SLLVDITESWLFGHALCKVIPYLQAVSVSVAVLTLSFIALDRWYAICHPLLFKSTARR 
ARGSILGIWAVSLAIMVPQAAVMECSSVLPEIANRTRLFSVCDERWADDLYPKIYHSC 
FFIVTYLAPLGLMAMAYFQIFRKLWGRQIPGTTSALVRNWKRPSDQLGDLEQGLSGEP 
QPRGRAFIAEVKQMRARRKTAKMIMVVLLVF7VLCYLPISVLNVLKRVFGMFRQASDRE 
AWACFTFSHWLVYANSAANPIIYNFLSGKFREQFKAAFSCCLPGLGPCGSLKAPSPR 
SSASHKSLSLQSRCSISKIS EHWLT SVTTVL P " 



ORIGIN 



Query Match 97.2%; Score 1079.4; DB 9; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 3e-204; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 



0; 



Db 



1 AT GGAGC C CT C AGC C AC C C C AGGGGC CCAGAT GGGGGTCCCCC CT GGC AGCAGAGAGC C G 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I M 

154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 



QY 



Db 



61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 12 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 



Qy 



Db 



121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
274 T ACCCAAAACAGT AT GAGT G GGT CCT CAT C GCAG C CTAT GT GGCT GT GT T C GT C GT GG C C 333 



181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

334 CT G GT GGG CAACAC GCT GGT CT GC CT GGC C GT GT GG C GGAAC CAC C AC AT GAG GACAGTC 393 

241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 

301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | I I I I I I I I I 
454 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

601 CTCTGT CAT GAACGCT GGGCAGATGACCT CTAT CCCAAGAT CTACCACAGTTGCTTCTTT 660 

I I I M I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I 
754 GT CT GT GAT GAAC GCT G GG C AGAT GAC CT CTAT C C C AAGAT CT AC C ACAGTT GCTTCTTT 813 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I. 
814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | 
874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

7 81 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | 
934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I I I I I I i Kl I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 1.14." 
994 CGCGCCTTCCTG G CT GAAGT GAAGC AGAT GC GT GCAC G GAGGAAGACAGCCAAGAT GCTG 1053 

901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 



I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 108 9 

I I I I I I I I I 

Db 1234 CTCAGTGGC 1242 



RESULT 13 

E43973 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



DNA linear 
(HFGAN72Y) . 



(HFGAN72Y) 



E43973 1133 bp DNA linear PAT 31-JAN-2002 

Novel G protein-coupled receptor 
E43973 

E43973.1 GI:18625172 
JP 2000106888-A/2. 
unidentified 
unidentified 
unclassified. 
1 (bases 1 to 1133) 
Bergsma, D.J. and Ellis, C.E. 
Novel G protein-coupled receptor 
Patent: JP 2000106888-A 2 18-APR-2000; 
SMITHKLINE BEECHAM CORP 
OS Unidentified 
PN JP 2000106888-A/2 
PD 18-APR-2000 
PF 21-JUL-1999 JP 1999206116 
PR 30-APR-1997 US 08/846705 
PI DERK J BERGSMA, CATHARINE ELIZABETH ELLIS 

PC C12N15/09,A61K38/00, A61K38/00, A61K45/00, A61K48/00, A61P1/00, PC 
A61P1/14, 

PC A61P9/02, A61P9/04,A61P9/10, A61P9/12, A61P11/06, A61P13/02, PC 
A61P13/08, 

PC A6lP19/10,A61P25/14,A61P25/16,A6lP25/18,A61P25/22,A61P25/24, 
PC A61P31/04, 

PC A61P31/10,A61P31/12,A61P31/18,A61P33/00,A61P35/00,A6lP37/08, 
PC A61P43/00, 

PC C07K14/705,C07K16/28,C12N1/21,C12N5/10, C12P21/02 , G01N33/566, 
PC G01N33/577// 

PC C12P21/08, (C12N15/09,C12R1:91) , (C12P21/02 , C12R1 : 91 ) , C12N15/00, 
PC A61K37/02, 

PC A61K37/02,C12N5/00, (C12N15/00 r C12R1 : 91) 
CC Strandedness : Single; 
CC Topology : Linear ; 

FH Key Location/Qualifiers 
FT source 1. .1133' 

FT /organism= 1 Unidentified 1 . 

Location/Qualifiers 
1. .1133 

/ organism="unidentif ied" 
/mol_type=" genomic DNA" 
/db xref="taxon: 32644" 



Query Match 97.2%; Score 1078.4; DB 6; Length 1133; 

Best Local Similarity 99.4%; Pred. No. 4.9e-204; 

Matches 1082; Conservative 0; Mismatches 6; Indels 0; Gaps 



0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


. 561 


Db 


- 661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 



AT GGAG C C C T CAGC C AC C C CAGGGG C C CAGAT GGGGGTCCCCCCT GGC AGC AGAGAGC C G 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I 

ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I 

TCCCCTGTGCCTC CAGAC TAT GAAGAT GAGTT T CT C C GCT AT CT GT G GC GT GATT AT CT G 120 

TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

M I I I I I I I M II I I I I I I I I I I I I II I I I I I I I I I II I I I I I I II 

CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 
M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | M I I I I II I 
ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 
I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 42 0 

CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I 
GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 4 8 0 

GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 
M I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I 
GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

CT CT GT CAT GAACGCT GGGCAGAT GACCT CTATCCCAAGAT CTACCACAGTT GCTT CTTT 660 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GT C T GT GAT GAAC GCT GGGCAGAT GAC CTCTAT C C CAAGAT C T AC C ACAGT TGCTTCTTT 660 



I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I 

AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 7 80 

CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 
I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I | I I | 

CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 



Qy 


841 


CGCGCCTTC CT GGCTGAAGT GAAGCAGAT GC GT GC ACGGAGGAAGAC AG C CAAGAT GCT G 

E 1 1 I 1 1 1 1 1 I 1 i 1 i i i f i i i i i i i i i i i i i i i i i i i i i i i i i i i > i > i i i i i i i f t i i i i 
1 I' M 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 II 1 1 1 1 II II II 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 Mill 

C GC GC CT T CCTGGCT GAAGT GAAGCAGAT GC GT GCACGGAGGAAGACAGC CAAGAT GC T G 


900 


Db 


841 


900 


Qy 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


960 


Db 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 


960 


Qy 


961 


AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 

l i t i i i i i i i i i i i i i i i i i i i i i ] i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 i 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1020 


Db 


961 


1020 


Qy 


1021 


ACCT T CT C C C ACT GGCT GGT GT ACGC CAAC AGC GCT GC C AAC C C CAT CAT CT ACAACT T C 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 | | | | 


1080 


Db 


1021 


ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCTVACCCCATCATCTACAACTTC 


1080 


Qy 


1081 


CTCAGTGG 1088 
1 1 1 1 1 1 1 1 
CTCAGTGG 1088 




Db 


1081 





RESULT 14 

AX746120 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



AX746120 1133 bp DNA linear PAT 12-JUN-2003 

Sequence 3 from Patent EP1156110. 

AX746120 

AX746120.1 GI:31744926 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Bergsma,D.J. and Ellis, C.E. 
G-protein coupled receptor (HFGAN72Y) 
Patent: EP 1156110-A 3 21-NOV-2001; 
SMITHKLINE BEECHAM CORPORATION (US) 

Location/ Qualifiers 

1. .1133 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 
/note="HGS EST 557082" 



Query Match 97. 2*J Score 1078.4; DB 6; 

Best Local Similarity 99.4%; Pred. No. 4.9e-204; 
Matches 1082; Conservative 0; Mismatches 6; 



Length 1133; 
Indels 0; 



Gaps 



0; 



Qy 
Db 



1 AT GGAG C C CT C AGC CAC C C CAGGGGCC CAGAT GGGGGTCCCCC CT GGC AG CAGAGAGC C G 60 

I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | 
1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 



Qy 

Db 



61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

61 TCCCCTGTGCCTC CAGACT AT GAAGAT GAGT T T CT C C GCT AT CT GT GG C GT GATT AT CT G 12 0 



121 T AC C CAAAAC AGT AT GAGT GGGT C CT CAT C GCAG CCT ATGT GGCT GT GTT C GT C GT GGCC 180 

I Ml I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 i 1 1 1 1 1 1 1 I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 i I I 1 1 1 1 1 1 I 1 1 1 I 1 1 1 1 1 1 1 

241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

361 GT CAT C C CCT AT CT AC AGGCT GT GT C C GT GT CAGT GGCAGT GCT AACT CT CAGCTT CAT C 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

421 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 4 80 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I Ml I I M I I II II II II M II II I M I II M II II M I I I II I I I II I I 

541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

601 CT CT GT CAT GAAC GCT GGGCAGAT GAC CT CT AT C CCAAGATCTAC C ACAGTT GCTTCTTT 660 

II M I I II I I I I I I M I II I II II M II I I I II II II II I II I M I II II II I II II I 

601 GT CT GT GAT GAAC G CT GGGCAGAT GAC CT CT AT C C CAAGAT CTAC C AC AGT T GCTTCTTT 660 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

II II II II II I I I I I I II I II I I I I I I I I I II M II I I I II II I I I I I I II I II II I II I 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I M I I I II I I I I I I I I I I I I I I M I M I II II I I M I I I I M I M I I I I II I I I I I I I 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I M I II II II II II I I II I II I I I I II I I I II I I I Mi, II I I I I I I I I I I I I I I II I I I 

781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I 1 I I I 1 I I I t 1 I 1 I I 1 II I I I 1 I I I I I I I I I I t I I I I I I I 1 I I I I I I I I I I 1 I I I I 1 I 1 I 

841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

II I I II II I I I I I I I I I I I I II I II I I I I I I I II I I II I I I I II I I II I I I I I I I II II 

901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 



Db 



I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 



Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

1 1 1 1 1 1 1 1 1 1 1 1 1 ii i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 

Qy 1081 CTCAGTGG 1088 

I I I I I I I I 

Db 1081 CTCAGTGG 1088 



RESULT 15 

E43972 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



DNA linear 
(HFGAN72Y) . 



PAT 31-JAN-2002 



(HFGAN72Y) 



E43972 1170 bp 

Novel G protein-coupled receptor 
E43972 

E43972.1 GI:18625171 
JP 2000106888-A/l. 
unidentified 
unidentified 
unclassified. 
1 (bases 1 to 1170) 
Bergsma, D. J. and Ellis, C.E. 
Novel G protein-coupled receptor 
Patent: JP 2000106888-A 1 18-APR-2000 ; 
SMITHKLINE BEECHAM CORP 
OS Unidentified 
PN JP 2000106888-A/l 
PD 18-APR-2000 
PF 21-JUL-1999 JP 1999206116 
PR 30-APR-1997 US 08/846705 
PI DERK J BERGSMA, CATHARINE ELIZABETH ELLIS 

PC C12N15/09,A6lK38/00,A61K38/00,A61K45/00,A61K48/00,A61Pl/00, PC 
A61P1/14, 

PC A6lP9/02,A61P9/04,A61P9/10,A61P9/12,A61Pll/06,A61P13/02, PC 
A61P13/08, 

PC A61P19/10,A61P25/14,A61P25/16,A61P25/18,A61P25/22,A61P25/24, 
PC A61P31/04, 

PC A61P31/10,A61P31/12,A61P31/18,A61P33/00,A61P35/00,A61P37/08, 
PC A61P43/00, 

PC C07K14/705,C07K16/28,C12Nl/21,C12N5/10,C12P21/02,G01N33/566, 
PC G01N33/577// 

PC C12P21/08, (C12N15/09, C12Rl:91) , (C12P21/02, C12R1 : 91) , C12N15/00, 
PC A61K37/02, 

PC A61K37'/ l 02,C12N5/00, (C12N15/00, C12R1 : 91 ) 
CC Strandedness : Single; 
CC Topology: Linear; 

FH Key Location/Qualifiers 
FT source 1. .1170 

FT /organism= f Unidentified 1 

Location/Qualifiers 
1. .1170 

/organism= "unidentified" 
/mol_type=" genomic DNA" 
/db xref="taxon: 32644" 



Query Match 97.2%; Score 1078.4; DB 6; Length 1170; 

Best Local Similarity 99.4%; Pred. No. 4.9e-204; 

Matches 1082; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 
I I I I I I I I I I I I I I I I I I 1 II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

Qy 61 TCCCCTGTGC CT CCAGACT AT GAAGAT GAGTTT CT C C GCT AT CT GT GGC GT GAT TAT CT G 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 TCCCCTGTGCCTC CAGACT AT GAAGAT GAGT T T CT C C GC T AT CT GT GG C GT GAT T AT CT G 120 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | 
Db 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | 
Db 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I. I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | I I I I I I 
Db 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 42 0 

I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I 
Db 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 42 0 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 48 0 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I J I I I I I I I I I 
Db 421 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 48 0 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

Qy 601 CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II II I I 
Db 601 GT CT GT GATGAACGCT GGGCAGATGACCT CTAT CCCAAGATCTACCACAGTT GCTTCTTT 660 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 



Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

M I I I U I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

QY 841 CGCGC CTTCCT GGCT GAAGT GAAGCAGAT GCGT GCAC GGAGGAAGACAGC CAAGAT GCTG 900 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | I | | | | | | | | | | 
Db 841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I 
Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I M I I I I I I I I I I I I I I I I I I I I I I || | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 

Qy 1081 CTCAGTGG 1088 

I I I I I I I I 
Db 1081 CTCAGTGG 1088 

Search completed: October 15, 2004, 19:59:37 
Job time : 4500.38 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 
Perfect score: 1110 



October 15, 2004, 13:52:46 ; Search time 474.138 Seconds 

(without alignments) 
9945.416 Million cell updates/sec 

US-10-070-532-3 



Sequence : 



1 atggagccctcagccacccc ttccctggagtctgctctaa 1110 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 3373863 seqs, 2124099041 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



6747726 



Post-processing: 



Database : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



N_Geneseq_29Jan04 : * * 

1: geneseqnl980s : * 

2: geneseqnl990s : * 

3 : geneseqn2000s : * 

4: geneseqn2001as : * 

5 : geneseqn2001bs : * 

6: geneseqn2002s : * 

7 : geneseqn2003as : * 

8: geneseqn2003bs:* 

9: geneseqn2003cs : * 

10 : geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 


1110 


100. 


0 


mo- 


4 


AAS00492 


2 


1102 


99. 


3 


1110 


2 


AAT42827 


3 


1079.4 


97. 


2 


1209 


2 


AAT42826 


4 


1079.4 


97. 


2 


1564 


2 


AAV63468 


5 


1079.4 


97. 


2 


1564 


2 


AAV68514 


6 


1079.4 


97. 


2 


1564 


4 


AAF32103 


7 


1079.4 


97. 


2 


1564 


6 


ABA96021 



Aas004 92 Human neu 
Aat42827 Neuropept 
Aat42826 Neuropept 
Aav63468 cDNA enco 
Aav68514 Nucleotid 
Aaf32103 Human HFG 
Aba96021 HGS EST 5 



8 


1079.4 


97. 


2 


1564 


6 


AAI64173 


Aai64173 


HFGAN72X 


9 


1079.4 


97. 


2 


1564 


6 


AAJ64172 


Aai64172 


Human HFG 


10 


1079.4 


97. 


2 


1564 


7 


ABZ42789 


Abz42789 


Human ore 


11 


1078.4 


97. 


2 


1133 


2 


AAV68512 


Aav68512 


Nucleotid 


12 


1078.4 


97. 


2 


1133 


6 


ABA96020 


Aba96020 


HGS EST 5 


13 


1078.4 


97. 


2 


1170 


2 


AAV68511 


Aav68511 


Nucleotid 


14 


1078.4 


97. 


2 


1170 


6 


ABA96019 


Aba96019 


G-protein 


15 


1077.8 


97. 


1 


1278 


4 


AAS00491 


Aas00491 


Human neu 


16 


1077.8 


97. 


1 


1564 


4 


AAS17464 


Aasl7464 


Human G p 


17 


1076.8 


97. 


0 


1133 


4 


AAS00493 


Aas00493 


Human neu 


18 


1075.2 


96. 


9 


1133 


2 


AAT42828 


Aat42828 


Neuropept 


19 


1074.6 


96. 


8 


1278 


5 


ABI98014 


Abi98014 


Non~endog 


20 


1031.4 


92. 


9 


1278 


4 


AAD09335 


Aad09335 


Cynomolgo 


21 


905 


81. 


5 


1281 


5 


AAC85974 


Aac85974 


Dog orexi 


22 


661 


59. 


5 


789 


2 


AAT13909 


Aatl3909 


Rabbit G- 


23 


661 


59. 


5 


789 


2 


AAT51065 


Aat51065 


G _ Drotei n 


24 


511.8 


46. 


1 


1633 


2 


AAX29700 


Aax29700 


Human 7— t 


25 


511.8 


46. 


1 


1843 


7 


ABZ42790 


Abz42790 


Human ore 


26 


507 


45. 


7 


1335 


4 


AAF90300 


Aaf 90300 


Nucleot"i d 


27 


507 


45. 


7 


1335 


5 


ABI98015 


Abi98015 


Non _ endog 


28 


241.2 


21. 


7 


344 


5 


AAF56748 


Aaf56748 


Human HCR 


29 


241.2 


21. 


7 


9785 


5 


AAF55159 


Aaf 55159 


Mu c 1 pot i H 


30 


241.2 


21. 


7 


10453 


4 


AAS17462 


Aasl7462 


Human G p 


31 


236 


21. 


3 


244 


4 


AAS17449 


Aasl7449 


Human G n 


32 


228.4 


20. 


6 


331 


5 


AAF56750 


Aaf 56750 


Human HCR 


33 


223. 8 


20. 


2 


227 


4 


AAS17453 


Aasl7453 


Human G p 


34 


199 


17. 


9 


356 


4 


AAS17445 


Aasl7445 


Human G p 


35 


198.6 


17. 


9 


452 


5 


AAF56746 


Aaf 56746 


Human HCR 


36 


181 


16. 


3 


263 


5 


AAF56747 


Aaf 56747 


Human HCR 


37 


179 


16. 


1 


179 


4 


AAS17447 


Aasl7447 


Human G o 


38 


161. 6 


14. 


6 


1290 


3 


AAA70507 


Aaa70507 


Novel hirm 

X « V V_- -X- XX CXI L L 


39 


161.6 


14. 


6 


1290 


3 


AAA70508 


Aaa70508 


Novel hum 

li V—* V _1_ XX 14.4.11 


40 


161.6 


14. 


6 


1290 


4 


AAH78518 


Aah78518 


Nucleot i d 


41 


161.6 


14. 


6 


1290 


4 


AAH78519 


Aah78519 


Nucleotid 


42 


161.6 


14. 


6 


1290 


7 


ACC49315 


Acc49315 


Human RFR 


43 


161.6 


14. 


6 


1290 


7 


ACC49316 


Acc49316 


Human RFR 


44 


161.6 


14. 


6 


1290 


9 


ADB76179 


Adb76179 


Altenativ 


45 


161.6 


14. 


6 


1290 


9 


ADB76178 


Adb76178 


Human 0T7 



ALIGNMENTS 



RESULT 1 
AAS00492 

ID AAS00492 standard; cDNA; 1110 BP. 
XX 

AC AAS00492; 
XX 

DT 17-MAY-2001 (first entry) 
XX 

DE Human neuropeptide receptor splice variant 1 cDNA. 
XX 

KW Human; neuropeptide receptor; neuropeptide Y receptor; obesity; 

KW nervous system disorder; hyperprolif erative disorder; diabetes mellitus ; 

KW cardiovascular disorder; autoimmune disorder; infectious disorder; 

KW eating behaviour disorder; narcolepsy; neurological disease; 



KW narcotics addiction; nicotine addiction; alcohol addiction; gene therapy; 

KW protein co-ordinate data; ss. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .1110 

FT /*tag= a 

FT /product^ "neuropeptide receptor splice variant 1" 

XX 

PN WO200117532-A1. 
XX 

PD 15-MAR-2001. 
XX 

PF 07-SEP-2000; 2000WO-US024518 . 
XX 

PR 10-SEP-1999; 99US-00393696 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Soppet DR, Li Y, Rosen CA; 
XX 

DR WPI; 2001-183276/18. 

DR P-PSDB; AAU00439. 
XX 

PT A new nucleic acid encoding a human neuropeptide receptor polypeptide, 

PT useful for preventing, treating or ameliorating obesity, narcolepsy, 

PT neurological disease and addiction to narcotics, nicotine and alcohol. 
XX 

PS Claim 4; Fig 2; 385pp; English. 
XX 

CC The present sequence encodes for human neuropeptide receptor splice 

CC variant 1. Two splice variants (AAU00439-AAU00440 ) and a possible mutant 

CC (AAU00442) of a novel human neuropeptide receptor (AAU00438) are 

CC described. The neuropeptide receptor shows sequence homology to the 

CC neuropeptide Y receptor. Polypeptides and polynucleotides of the 

CC neuropeptide receptor are useful for diagnosing, preventing, or treating 

CC a pathological condition in a subject related to the central nervous and 

CC peripheral nervous systems (CNS and PNS) . The polypeptides and 

CC polynucleotides may be used to treat hyperprolif erative, cardiovascular, 

CC autoimmune, nervous system or infectious disorders e.g. cancer, heart 

CC disease, rheumatoid arthritis, Alzheimer f s disease, HIV infection and 

CC diabetes mellitus . In particular they are useful for preventing, treating 

CC or ameliorating a medical condition in a mammal such as obesity/eating 

CC behaviour disorders, narcolepsy, neurological disease, addiction to 

CC narcotics, nicotine and alcohol, chronic pain, acute pain, migraine 

CC headaches and anxiety disorders. The polynucleotides encoding the 

CC neuropeptide receptor can also be used in gene therapy methods for 

CC treating such diseases 

XX 

SQ Sequence 1110 BP; 194 A; 369 C; 300 G; 247 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1110; DB 4; Length 1110; 

Best Local Similarity 100.0%; Pred. No. 4.6e-253; 

Matches 1110; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


i 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 


Qy 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I II I I 




Db 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 .1 1 1 1 1 1 1 1 I I I I I I | | | | | 




Db 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCT7\ACTCTCAGCTTCATC 


420 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I | | | | | 




Db 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 


Qy 


541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


GCAGTCAT GCAAT CCAG C AGT GT GCT GC CT GAGCT AG C CAACCGCACAC GGCT CT T CT CA 


600 


Qy 


601 


CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 


660 


Qy 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCeAGATATTCCGC 


720 


Qy 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 


Qy 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 


Qy 


841 


CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 


900 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 



841 CGC GC CTT C CT GGCT GAAGTGAAGCAGAT GC GT G C AC GGAG GAAGACAGC CAAGAT GCT G 900 



Qy 



901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 




Db 



901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 



Qy 



Db 



961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I ! I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 



Qy 



Db 



1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 



Qy 



1081 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 




Db 



1081 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 



RESULT 2 
AAT42827 

ID AAT42827 standard; cDNA; 1110 BP. 
XX 

AC AAT42827; 
XX 

DT 22-FEB-1997 (first entry) 
XX 

DE Neuropeptide receptor splice variant-1 gene. 
XX 

KW Human; neuropeptide receptor; splice variant; drug screening; 

KW receptor-agonist; receptor-antagonist; anorectic; antitumour; 

KW anticholes terolemic; neuroprotective ; anticonvulsant ; hypotensive ; 

KW sedative; diagnostic; gene therapy; ss. 

XX 

OS Homo sapiens. 
XX 

PN W09634877-A1. 
XX 

PD 07-NOV-1996. 
XX 

PF 05-MAY-1995; 95WO-US005616 . 
XX 

PR 05-MAY-1995; 95WO-US005616 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 

XX . <,;' ' 

PI Soppet DR, Li Y, Rosen CA; 

XX 

DR WPI; 1996-506094/50. 
DR P-PSDB; AAW06125. 
XX 

PT Human neuro-peptide receptor polypeptide ( s ) - used to identify 

PT antagonists and agonists to such polypeptide ( s ) , e.g. in the treatment of 

PT obesity, Alzheimer's disease, epilepsy, etc. 

XX 

PS Disclosure; Page 50-51; 77pp; English. 
XX 



CC The sequence encodes human neuropeptide receptor splice variant-1, which 

CC retains activity corresponding to the mature receptor (encoded by 

CC AAT42826) . The receptor gene has been isolated from from a human adult 

CC hypothalamus cDNA library, and is structurally related to the G-protein- 

CC coupled receptor family. The receptor may be used in a drug screening 

CC assay for isolation of receptor-agonists and -antagonists, which may be 

CC used as anorectic, antitumour, anticholesterolemic, neuroprotective, 

CC anticonvulsant, hypotensive or sedative drugs, etc. The DNA may also be 

CC used in genetic disease diagnosis or gene therapy. The receptor and its 

CC corresponding antibody may also be used in therapy and diagnosis 
XX 

SQ Sequence 1110 BP; 194 A; 364 C; 305 G; 247 T; 0 U; 0 Other; 



Query Match 99.3%; Score 1102; DB 2; Length 1110; 

Best Local Similarity 99.5%; Pred. No. 3.6e-251; 



Matches 1105; Conservative 0; Mismatches 5; Indels 0; Gaps 


0 


Qy 


1 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 
1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

AT GGAGCC CT C AG CC AC C G CAGGGGC C CAGAT GGGGGT CC C C C CT GGCAGC AGAGAGC C G 


60 


Db 


1 


60 


Qy 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 
! 1 1 1 1 1 1 1 1 II ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 I I I I I I I i i i i i i i i i i 

1 1 1 M 1 1 1 1 M M 1 M M 1 I! II 1 M M M | 1 1 1 1 II II 1 1 1 1 M 1 II II 1 II II 

TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 


Db 


61 


120 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
' 1 1 1 1 1 1 1 1 > i i i i i i i i i i i i i i i i i i i i i ii I I l l l l l l l 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 

TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Db 


121 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 

1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 
• i i i i i i i i i ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i j i 1 i i i i i \ \ 

CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Db 


181 


240 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | | | | | | | 
ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Db 


241 


300 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 
1 1 1 1 M M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 || I 
CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Db 


301 


360 


Qy 

Db 


361 
361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCT7^ACTCTCAGCTTCATC 


420 
420 


Qy 

Db 


421 
421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTAT'i'GTTCAAGAGCACAGCCCGGCGG 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 
480 


Qy 

Db 


481 
481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 
1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I | | | | | | | 
GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 
540 


Qy 

Db 


541 
541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 
1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 I 1 1 I I I I I I I 1 1 1 1 I I I I | | | || | | | | | | | | | | | | || | | | | | 
GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 
600 



Qy 601 CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

Qy 661 ATTGTCACCTACCTGGCCGCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 72 0 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II II I I I 

Db 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I II I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

Qy 841 CGCGCCTTCCTGGCT GAAGT GAAGCAGAT GC GT GC AC GGAGGAAGACAGC C AAGAT G CT G 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 CGCGCCTTCCTGGCT GAAGT GAAGCAGAT GC GT GCAC GGAGGAAGACAGC CAAGAT GCTG 900 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I | | I I I I I I 

Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

Qy 1081 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 1081 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 



RESULT 3 
AAT42826 

ID AAT42826 standard; cDNA; 1209 BP. 
XX 

AC AAT42826; 
XX 

DT 22-FEB-1997 (first entry) 
XX 



DE Neuropeptide receptor gene. 
XX 

KW Human; neuropeptide receptor; drug screening; receptor-agonist; 

KW receptor-antagonist; anorectic; antitumour ; anticholesterolemic; 

KW neuroprotective; anticonvulsant; hypotensive; sedative; diagnostic; 

KW gene therapy; ss. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT primer_bind complement ( 1 . . 18 ) 

FT " /*tag= a 



FT ( /note= "Binds primer AAT42829" 

FT misc_dif ference 151. .153 
FT 
FT 

FT prime r_bind 
FT 
FT 
XX 

PN W09634877-A1. 
XX 

PD 07-NOV-1996. 
XX 

PF 05-MAY-1995; 
XX 

PR 05-MAY-1995; 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Soppet DR, Li Y, Rosen CA; 
XX 

DR WPI; 1996-506094/50. 



DR P-PSDB; AAW06124. 
XX 

PT Human neuro-peptide receptor polypeptide (s ) - used to identify 

PT antagonists and agonists to such polypeptide (s ) , e.g. in the treatment of 

PT obesity, Alzheimer's disease, epilepsy, etc. 

XX 

PS Claim 6; Page 48-49; 77pp; English. 
XX 

CC The sequence encodes a human neuropeptide receptor, and has been mapped 

CC to human chromosome lq31-34. The sequence has been isolated from a human 

CC adult hypothalamus cDNA library, and is structurally related to the G- 

CC protein-coupled receptor family. Splice variants are given in AAT42827- 

CC 28. The sequence may be amplified by PCR with e.g. primers AAT42829-34 

CC for expression in a host cell. The recombinant receptor may be used in a 

CC drug screening assay for isolation of receptor-agonists and -antagonists, 

CC which may be used as anorectic, antitumour, anticholesterolemic, 

CC neuroprotective, anticonvulsant, hypotensive or sedative drugs, etc. The 

CC DNA may also be used in genetic disease diagnosis or gene therapy. The 

CC receptor and its corresponding antibody may also be used in therapy and 

CC diagnosis 
XX 

SQ Sequence 1209 BP; 206 A; 402 C; 330 G; 271 T; 0 U; 0 Other; 



Query Match 97.2%; Score 1079,4; DB 2; Length 1209; 

Best Local Similarity 99.4%; Pred. No . ' 8 . 3e-246; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 



Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

M M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I | I | | | | | | | | | | | | | | | | | | | | | | | | 

Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

QY 61 TCCCCTGTGCCTC CAGACT AT GAAGAT GAGT TTCT CC GCT AT CT GT G GC GT GAT T AT CT G 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I || | I I I I I I II 

Db 61 TCCCCTGTGCCTC C AGACTAT GAAGAT GAGT T T CT CCGCTAT CT GT GGCGT GAT TAT CT G 120 

QY 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 



/*tag= b 

/codon= seqrCCA, aa:Ala 
complement (1190. . 1192) 
/*tag= c 

/note= "Binds primers AAT42830 and AAT42832" 



95WO-US005616. 
95WO-US005616. 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I. I I I I 



Db 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCCCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 | M | | | 1 1 1 1 1 1 1 1 




Db 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGr 




Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


S4 0 

•J rt \J 


Qy 


541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 






1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I | | | | | | | | | | | | | | | | 




Db 


541 


GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACrGrArArGGrTrTTrTrA 

> -' >—*■*■ w >— a -fc vj ^-JJ u ix >J v-*.£ x v_j v_*rj LvJ _L v— ' X VJ V^ X V-* V*« X *»-»il\J v-» X ilvJVyV/uiv/ V_J v/iiViW-/ VJ VJ V»* X v** X X V-* X V-'.iV 


\j \J kj 


Qy 


601 


CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


GT CT GT CAT GAACGCT GGGC AGAT GAC CT CT AT C C C AAG AT CT A C C AC A RTT G.C T T r T T T 

v — ' v** ^ vj ^ v^j vj.* kv-* \j x— ' J- vj vj vj V/nUii x vj.*lv_*' X v_* X J L x v*» v^ V^iuiVj/i x v-» X nVu> ^iW^iivJ X X \J v> X X XXX 


U D VJ 


Qy 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 


Qy 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 


Qy 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I I I I I I I 




Db 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGA^CCCCAGCCCCGGGGC 


840 


Qy 


841 


CGCGCCTTCCT GGCTGAAGT GAAGCAGAT GCGTGCACGGAGGAAGACAGCCAAGAT GCTG 


900 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


841 


C GCGCCT T C CT GGCTGAAGT GAAGCAGAT GC GT GC AC GGAGGAAGACAGC CAAGAT GCT G 


900 


Qy 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 


960 


Qy 


961 


AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1020 



1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 



961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 



Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

Qy 1081 CTCAGTGGC 1089 

I I I I I I I I I 
Db 1081 CTCAGTGGC 108 9 

RESULT 4 
AAV63468 

ID AAV63468 standard; cDNA; 1564 BP. 
XX 

AC AAV63468; 
XX 

DT 26-JAN-1999 (first entry) 
XX 

DE cDNA encoding G-protein coupled receptor (HFGAN72X) polypeptide. 
XX 

KW G-protein coupled receptor; HFGAN72X; HIV infection; anorexia; cancer; 

KW bulimia; asthma; Parkinson's disease; acute heart failure; 

KW urinary retention; osteoporosis; angina pectoris; myocardial infarction; 

KW benign prostatic hypertrophy; neurological disorder; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 154. .1431 

FT /*tag= a 

FT /product^ "HFGAN72X" 

XX 

PN EP875566-A2. 
XX 

PD 04-NOV-1998. 
XX 

PF 27-OCT-1997; 97EP-00308563 . 
XX 

PR 30-APR-1997; 97US-00846704 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis CE; 
XX 

DR WPI; 1998-55^432/48. . 

DR P-PSDB; AAW80456. 

XX 

PT New human G-protein coupled receptor HFGAN72X polypeptide and 

PT polynucleotide - useful as diagnostic reagents and for treating e.g. HIV 

PT infection, cancer and Parkinson's disease. 

XX 

PS Claim 3; Page 7; 24pp; English. 
XX 

CC The present sequence encodes a G-protein coupled receptor (HFGAN72X) 
CC polypeptide. HFGAN72X polypeptides and polynucleotides are useful for 
CC diagnosing diseases related to over or under expression of HFGAN72X 



CC proteins by identifying mutations in the HFGAN72X gene using HFGAN72X 

CC probes, or determining HFGAN72X protein or mRNA expression levels. 

CC HFGAN72X polypeptides are also useful for screening for compounds which 

CC affect activity of the protein. Diseases that can be treated with 

CC HFGAN72X include HIV infections, pain, anorexia, cancers, bulimia, 

CC asthma, Parkinson's disease, acute heart failure, hypotension, 

CC hypertension, urinary retention, osteoporosis, angina pectoris, 

CC myocardial infarction, ulcers, allergies, benign prostatic hypertrophy, 

CC and psychotic and neurological disorders 

XX 

SQ Sequence 1564 BP; 271 A; 511 C; 435 G; 347 T; 0 U; 0 Other; 



Query Match 97.2%; Score 1079.4; DB 2; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 8.9e-246; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 274' TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | 

Db 334 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 

Qy 241 AC CAACT AC T T CAT T GT CAAC CT GT CCCT GGCT GAC GTT CT G GT GACT GCT AT CT GC CT G 300 

I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | M 
Db 394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 



Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

M I II I I I I I I II I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

Db 454 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 42 0 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 



Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I M II I I I I II I I I I I I I II I I ii I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 



Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I M I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CTCT GT CAT GAAC GCT G GGCAGAT GAC CT CT AT C C CAAGAT CT ACCACAGT TGCTTCTTT 660 



I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i i 



Db 


754 


GT CT GT GAT GAAC G C T G GGCAGAT GACCT CT AT C C C AAGAT CT AC C AC AGT TGCTTCTTT 


813 


Qy 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 




Db 


814 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


873 


Qy 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I 1 1 1 I I I I I I I I | | | M 1 1 1 1 1 1 1 1 1 1 




Db 


874 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


933 


Qy 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I M | | | | | | | | | 




Db 


934 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


993 


Qy 


841 


CGCGCCTTC CT GGCT GAAGT GAAG C AGAT GC GT GC AC GGAG GAAGACAGC C AAGAT GCT G 


900 






1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


994 


CGCGCCT T C CT GGCT GAAGT GAAGCAGAT GCGT GCAC G GAGGAAGACAGCCAAGAT GCT G 


1053 


Qy 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1054 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 


1113 


Qy 


961 


AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1020 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


1114 


AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1173 


Qv 


1021 


ACCTTCTCCCACTGGCTGGTGT ACGCCA AC AQCftCT^rr A APrrr ATP atttat a APTTr 


lUoU 






1 1 M 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1174 


ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 


1233 


Qy 


1081 


CTCAGTGGC 1089 




Db 


1234 


1 1 1 1 1 1 1 1 1 

CTCAGTGGC 1242 





RESULT 5 
AAV68514 

ID AAV68514 standard; cDNA; 1564 BP. 
XX 

AC AAV68514; 
XX 

DT 29-JAN-1999 (first entry) 
XX 

DE Nucleotide sequence of a probe HGS EST 554692. 

XX - ^ 1 • ' 

KW Probe HGS EST 554 692; G-protein coupled receptor family; HFGAN72Y; 



KW mutation; probe; agonist; antagonist; activation; inhibition; 

KW gene therapy; antibody; immune response; vaccine; HIV-1; HIV-2; cancer; 

KW anorexia; bulimia; asthma; Parkinson f s disease; acute heart failure; 

KW hypotension; hypertension; urinary retention; osteoporosis; 

KW angina pectoris; myocardial infarction; ulcer; allergies; 

KW psychotic disorder; neurological disorder; gene mapping; ss. 
XX 

OS Synthetic. 

OS Homo sapiens. 
XX 



PN EP875565-A2. 
XX 

PD 04-NOV-1998. 

XX • 

PF 27-OCT-1997; 97EP-00308554 . 
XX 

PR 30-APR-1997; 97US-00846705 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis C; 
XX 

DR WPI; 1998-570286/49. 
XX 

PT New G-protein coupled receptor HFGAN72Y polypeptide and polynucleotide - 

PT useful as diagnostic reagents and for prevention and treatment of HIV 

PT infections, cancer, osteoporosis and Parkinson f s disease. 
XX 

PS Example 1; Page 19-20; 22pp; English. 
XX 

CC This is the nucleotide sequence of the probe HGS EST 554 692 used in the 

CC method of the invention involving the G-protein coupled receptor, 

CC HFGAN72Y. Its polypeptides and polynucleotides are useful for diagnosing 

CC susceptibility to diseases by detecting mutations in the HFGAN72Y gene 

CC using probes containing the HFGAN72Y nucleotide sequence, and can 

CC diagnose diseases associated with HFGAN72Y imbalance by determining 

CC HFGAN72Y polypeptide or mRNA expression levels. Agonists/antagonists can 

CC be used in treatment to activate/inhibit HFGAN72Y activity, in addition 

CC to direct administration of antisense sequences to prevent expression, or 

CC HFGAN72Y polypeptides to treat conditions associated with a lack HFGAN72Y 

CC protein. Gene therapy may also be used to affect endogenous HFGAN72Y 

CC polypeptide production. HFGAN72Y antibodies are useful for inducing an 

CC immune response to immunise and prevent diseases, and for isolating 

CC HFGAN72Y clones or purifying the polypeptides by affinity chromatography. 

CC HFGAN72Y polypeptides can be administered directly or as a vaccine to 

CC inoculate against diseases. Diseases diagnosed, prevented or treated 

CC include HIV-1 or HIV-2 infections, pain, cancers, anorexia, bulimia, 

CC asthma, Parkinson's disease, acute heart failure, hypotension, 

CC hypertension, urinary retention, osteoporosis, angina pectoris, 

CC myocardial infarction, ulcers; allergies, benign prostatic hypertrophy, 

CC and psychotic and neurological disorders. The HFGAN72Y polypeptide is 

CC also useful for mapping the gene to a chromosome, allowing gene 

CC inheritance to be studied through linkage analysis 

XX 

SQ Sequence 1564 BP; 269 A; 508 C; 436 G; 347 T; 0 U; 4 Other; 

Query Match 97.2%; Score 1079.4; DB 2; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 8.9e-246; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I M I I I I I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I | | | | | 
Db 154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

Qy 61 TCCCCTGTGCCTC C AGACTAT GAAGAT GAGT TT CT C C GCT AT C T GT GGCGT GATT AT CT G 120 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 



Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I ! I I I I I I 

Db 274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

QY 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I || 
Db 334 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 454 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I M | | 

Db 574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTGAAGAGCACAGCCCGGCGG 633 

Qy 4 81 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

MINIMI Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CT CT GT CATGAAC GCT GGGC AGAT GAC CT CTAT C CCAAGAT CT AC C ACAGTTGCT T CT T T 660 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | M I I I I I I 
Db 754 GT CT GTGAT GAAC GCT GG GCAGAT GAC CT CT AT C CCAAGAT CT AC CACAGTTGCT T CT TT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

Qy 841 CGC GCCTTCCTGGCTGAAGTGAAGCAGAT GCGT GCACGGAGGAAGACAGC CAAGAT GCT G 900 

M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 994 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

MM I II I II I I II I I III I I III I I M | | | | | M | | | | || | | | | | | | 

Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 



Qy 

Db 



961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 
I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 



Qy 


1021 


ACCTTCTCCCACTGGCTGGTGTACGCCAACAGrGCTGCCAACCCrATCATrTAf'AArTTr 


1080 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 M 1 1 1 1 M | M | | M | | M II 1 1 1 I 1 1 1 1 1 1 1 II 1 ! 1 1 1 1 1 1 [ 1 




Db 


1174 


ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 


1233 


Qy 


1081 


CTCAGTGGC 108 9 








1 1 1 1 1 1 II 1 




Db 


1234 


CTCAGTGGC 1242 




RESULT 6 








AAF32103 








ID AAF32103 


standard; cDNA; 1564 BP. 





XX 

AC AAF32103; 
XX 

DT 10-APR-2001 (first entry) 
XX 

DE Human HFGAN72 receptor coding sequence SEQ ID NO: 12. 
XX 

KW Human; mouse; rat; Lig72A; Lig72B; neuropeptide receptor; HFGAN72; . 

KW truncation mutant; ligand; neurodegenerative disorder; pain; 

KW eating disorder; behaviour disorder; mood disorder; ss. 
XX 

OS Homo sapiens. 
XX 

PN WO200100787-A2. 
XX 

PD 04-JAN-2001. 
XX 

PF 22-JUN-2000; 2000WO-US017251 . 
XX 

PR 25-JUN-1999; 99US-0141156P . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Bingham S, Darker J, Liu W, Martin JD, Parsons AA, Patel SR; 
XX 

DR WPI; 2001-071483/08. 
XX 

PT Polynucleotides encoding Lig 72A polypeptides or their variants, which 

PT are useful in the treatment of if disease or disorder associated with 

PT pain, e.g. enhanced or exaggerated sensitivity to pain, hyperalgesia, 

PT neuropathic pain and back pain. 
XX 

PS Disclosure; Fig 6; lOlpp; English. 
XX 

CC The present invention provides the protein and coding sequences for the 

CC human, mouse and rat HFGAN receptor ligand Lig72A. It also provides 

CC truncated mutant versions. These, and their agonists and antagonists, are 

CC all useful in the treatment of eating, neurodegenerative, behaviour, 

CC mood, sexual, hormonal and sleep disorders, pain, depression, epilepsy 

CC and acute inflammatory conditions 



XX 

SQ Sequence 1564 BP; 271 A; 511 C; 435 G; 347 T; 0 U; 0 Other; 



Query Match 97.2%; Score 1079.4; DB 4; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 8.9e-246; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

Qy 61 T C CCCTGT GCCT C CAGACT AT GAAGAT GAGT T T C T C C GCTAT CT GT GGC GT GAT TAT CT G 120 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 334 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 454 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 634 GCCCGTGGCTGCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

Qy 541 G C AGT CAT GCAATCCAGCAGT GT GCTGC CT GAGCTAGCCAAC C GCAC AC GGCT CT T CT CA 600 

I I I I I I I I I III I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 694 GCAGT CAT GGAAT GCAGCAGT GTG CT GCCT GAGCTAGCCAAC C G CACAC GGC : i CT T CT CA 753 

Qy 601 CTCT GT CATGAACGCTGGGCAGATGACCTCTATCCCAAGAT CTACCACAGTT GCTT CTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | M | | | | | | | | | | | 
Db 754 GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I II I I I I I I I I I I I I I I Mill I II I I I I I I I I 

Db 814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I II I I I I I I I I II II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 



Db 874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

I I M I I I I I II I I I I I I I I I !H I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

Qy 841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I | | I I I I I I I I I 
Db 994 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I 
Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 1089 

I I I I I II I I 

Db 1234 CTCAGTGGC 1242 



RESULT 7 
ABA96021 



ID ABA96021 standard; cDNA; 1564 BP. 
XX 

AC ABA96021; 
XX 

DT 12-MAR-2002 (first entry) 
XX 

DE HGS EST 554692. 
XX 

' KW G-protein; receptor; HFGAN72Y; cytostatic; cardiant; analgesic; cancer; 

KW nootropic; tranquillising; neuroprotective; anti-asthmatic; gene therapy; 

KW infection; HIV-1; pain; anorexia; bulimia; Parkinson's disease; ulcer; 

KW cardiac disease; urinary retention; asthma; allergy; psychotic disorder; 

KW benign prostatic hypertrophy; neurological disorder; anxiety; delirium; 

KW schizophrenia; manic depression; dementia; mental retardation; EST; 

KW dyskinesia; Huntington's disease; Tourette 1 s syndrome; HIV-2; 

KW HGS EST 554692; expressed sequence tag; probe; ss. 
XX 

OS Homo sapiens . 



XX 

PN EP1156110-A2. 
XX 

PD 21-NOV-2001. 
XX 

PF 27-OCT-1997; 2001EP-00203010 . 
XX 

PR 30-APR-1997; 97US-00846705 . 

PR 27-OCT-1997; 97EP-00308554 . 
XX 



PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis CE; 
XX 

DR WPI; 2002-084320/12. 
XX 

PT New polynucleotide encoding a G-protein coupled receptor designated 

PT HFGAN72Y is useful to diagnose and treat associated diseases including 

PT cancer, infection, cardiac disease and psychotic and neurological 

PT disorders. 
XX 

PS Example 1; Page 19-20; 22pp; English. 
XX 

CC The sequence represents HGS EST 554692. The sequence was used in the 

CC invention as a probe to screen a human genomic placenta phage library. 

CC The invention relates to a novel isolated polynucleotide encoding 

CC HFGAN72Y polypeptide. The polypeptide of the invention has cytostatic, 

CC cardiant, analgesic, tranquillising, nootropic, neuroprotective, and anti 

CC -asthmatic activity. The HFGAN72Y has a use in gene therapy. The HFGAN72Y 

CC polynucleotide or an HFGAN72Y polypeptide agonist are used to treat a 

CC subject in need of enhanced HFGAN72Y activity or expression. An HFGAN72Y 

CC antagonist or competitor, or nucleic acid which inhibits HFGAN72Y 

CC expression is used to treat a subject in need of decreased HFGAN72Y 

CC activity or expression. HFGAN72Y-associated diseases include infections, 

CC particularly by HIV-1 or HIV-2, pain, anorexia, bulimia, Parkinson's 

CC disease, cardiac diseases, cancers, ulcers, urinary retention, asthma, 

CC allergies, benign prostatic hypertrophy, and psychotic and neurological 

CC disorders including anxiety, schizophrenia, manic depression, delirium, 

CC dementia, severe mental retardation and dyskinesias such as Huntington's 

CC disease and Tourette 1 s syndrome 

XX 

SQ Sequence 1564 BP; 269 A; 508 C; 436 G; 347 T; 0 U; 4 Other; 



Query Match 97.2%; Score 1079.4; DB 6; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 8.9e-246; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

Qy 61 TCCCCTGTGCCTC CAGACT AT GAAGAT GAGTT T CT CCGCTAT CT GT GGCGT GAT TAT CT G 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 

Qy "r 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 334 CT GGT GGG CAACAC GCT GGT CT GCCT GGC C GT GT GGCGGAAC CAC CAC AT GAGGACAGTC 393 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 



Qy 


301 


Db 


454 


Qy 


361 


Db 


514 


Qy 


421 


Db 


574 


Qy 


481 


Db 


634 


Qy 


541 


Db 


694 


Qy 


601 


Db 


754 


Qy 


661 


Db 


814 


Qy 


721 


Db 


874 


Qy 


781 


Db 


934 


Qy 


841 


Db 


994 


Qy 


901 


Db 


1054 


Qy 


961 


Db 


1114 


Qy 


1021 


nU 
UD 


11/4 


Qy 


1081 


Db 


1234 



CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 
I I M I I II I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | M | | | | 
CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 
I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I II I I I I I I II I I 
GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I | | | | I I I I I I I I I I I I I I I I I 

GC C CT G GAC CGCT GGT AT GC C AT CT GC CAC C CACT ATT GT T CAAGAG CAC AGC C C GGCGG 633 

GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | M I I I I I I I I I I I I I I I I I 

GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 
I I I I I I I I I III I I I I I I I II I I | | | | | | | | | | | M I I I II I I I I I | | | || | | | | || | 
GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

CT CT GT CAT GAACGCTGGGCAGAT GACCT CT AT CCCAAGAT CTACCACAGTTGCTT CTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

GT CT GT GAT GAACGC T GGGCAGAT GAC CT CTAT CCCAAGAT CT AC CAC AGT T GCTT CT T T 813 

ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 72 0 
I I I I I I J I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I | I | | 
ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 7 80 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 
I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I Mill 

CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 

ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I II I II I Ill I I I I I I I I I I I I I I I I I I 

ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I M I I II I I I I I I I INI I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I II I I I 

ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

CTCAGTGGC 1089 
I I I I I I I I I 
CTCAGTGGC 1242 



RESULT 8 
AAI64173 ' 



ID AAI64173 standard; cDNA;., 1564 BP. 
XX 

AC AAI64173; 
XX 

DT 22-JAN-2002 (first entry) 
XX 

DE HFGAN72X G coupled receptor polypeptide partial sequence. 
XX 

KW Antibacterial; fungicide; virucide; protozoacide; anti-HIV; analgesic; 

KW cytostatic; nootropic; antiparkinsonian; cardiant; antiulcer; 

KW antiasthmatic; tranquiliser ; neuroleptic; antidepressant; anticonvulsant; 

KW osteopathic; HIV infection; pain; cancer; anorexia; bulimia; 

KW Parkinson's disease; acute heart failure; hypotension; hypertension; 

KW urinary retention; osteoporosis; angina pectoris; probe; 

KW myocardial infarction; ulcers; asthma; allergy; delirium; dementia; 

KW benign prostatic hypertrophy; anxiety; schizophrenia; manic depression; 

KW dyskinesia; G coupled receptor; HFGAN72X; 7 transmembrane receptor; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT CDS 154. .1362 

FT /*tag= a 

FT /partial 

FT /product^ "HFGAN72X protein" 

FT /note= "The specification states that this is a partial 

FT sequence even though it contains start and stop codons; 

FT HFGAN72X is a G coupled receptor polypeptide" 

FT /transl_except= (pos:991. .993, aa:Ala) 

XX 

PN EP1154019-A2. 
XX 

PD 14-NOV-2001. 
XX 

PF 27-OCT-1997; 2001EP-00203008 . 
XX 

PR 30-APR-1997; 97US-00846704 . 
PR 27-OCT-1997; 97EP-00308563 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis CE; 
XX 

DR WPI; 2002-012659/02. 
DR P-PSDB; AAG78346. 
XX 

PT Nucleic acid encoding the HFGAN72X receptor, useful for diagnosis and 
PT treatment of e.g. infections, cancer, anorexia, bulimia, Parkinson's 
PT disease, and acute heart failure. 
XX 

PS Example 3; Page 9; 2 4pp; English. 

_XX 

CC The present sequence is that of a known partial nucleotide sequence 
CC encoding a HFGAN72X polypeptide (AAG78346) used as a probe to identify 
CC the HFGAN72X gene (AAI64173) . The specification describes a newly 



CC isolated polynucleotide encoding a human HFGAN72X G coupled receptor 

CC polypeptide. The protein of the invention has antibacterial, fungicide, 

CC virucide, protozoacide, anti-HIV, cardiant, analgesic, cytostatic, 

CC nootropic, antiparkinsonian, antiulcer, antiasthmatic, tranquiliser, 

CC neuroleptic, antidepressant, anticonvulsant and osteopathic activities. 

CC HFGAN72X polynucleotides (PNs) are used to express HFGAN72X in vivo, to 

CC treat diseases requiring increased activity or expression of HFGAN72X; 

CC for recombinant production of HFGAN72X; diagnose diseases by detecting 

CC mutations in genomic sequences and in chromosome identification and 

CC mapping. HFGAN72X polypeptides are used to raise specific antibodies; as 

CC therapeutic agents; to identify HFGAN72X protein-expressing clones; to 

CC purify HFGAN72X proteins; in vaccines. Cells transformed with HFGAN72X 

CC PNs are used to identify (ant ) agonists of HFGAN72X, useful 

CC therapeutically. Nucleic acids that inhibit expression of HFGAN72X and 

CC polypeptides that compete with ligands for binding to HFGAN72X proteins 

CC are also useful therapeutically and diagnostically . HFGAN72X-related 

CC diseases include infections (bacterial, viral, fungal or protozoal, 

CC particularly HIV-1 or -2); pain; cancer; anorexia; bulimia; Parkinson ! s 

CC disease; acute heart failure; hypotension; hypertension; urinary 

CC retention; osteoporosis; angina pectoris; myocardial infarction; ulcers; 

CC asthma; allergy; benign prostatic hypertrophy; anxiety; schizophrenia; 

CC manic depression; delirium; dementia; severe mental retardation and 

CC dyskinesias 

XX 

SQ Sequence 1564 BP; 269 A; 508 C; 436 G; 347 T; 0 U; 4 Other; 



Query Match 97.2%; Score 1079.4; DB 6; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 8.9e-246; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0 



Qy 


l 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 






1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


154 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


213 


Qy 


61 


TCCCCTGTGCCTC C AGACT AT GAAGAT GAGT T T CTCCGCT AT CT GT GGC GT GAT TAT CT G 


120 






1 1 1 1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


214 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


273 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I M | | | | | | | | | | | | | | | | | | M | | | | || || | | | 




Db 


274 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


333 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1.1 1 I I I I I I | | | 




Db 


334 


CT GGT GGG CAACAC GCT GGTCTGCCTGGCCGTGTGGCG GAAC CACC AC AT GAG GAC AGT C 


393 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I I | | II | | | | | || | 




Db 


394 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


453 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


454 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


513 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


514 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


573 



Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I 

Db 634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

Qy 541 GC AGT CAT GCAAT C C AGC AGT GT GCT GC CT GAGCT AG C C AAC C G C AC AC GGCT CT T CT CA 600 

I I I I I II I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CT CT GT CAT GAAC GCT GG GCAGAT GACCT CT AT C C CAAGAT CT AC CAC AGT TGCTTCTTT 660 

Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 754 GT CT GT GAT GAAC G C T GGGCAGAT GACCT CT AT CC CAAGAT CTAC C AC AGTT GCTT CT TT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 72 0 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 87 4 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

Qy 841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 994 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 AC CT T CT C C C ACT G G CT GGT GT AC G C CAAC AG C GCT G C CAAC C C CAT CAT CTAC AACTT C 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 117 4 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 1089 

MINIMI 

Db 1234 CTCAGTGGC 1242 



RESULT 9 
AAI64172 

ID AAI64172 standard; cDNA; 1564 BP. 
XX 

AC, AAI64172; 
XX 

DT 22-JAN-2002 (first entry) 



XX 

DE Human HFGAN72X G coupled receptor polypeptide cDNA. 
XX 

KW Antibacterial; fungicide; virucide; protozoacide; anti-HIV; analgesic; 

KW cytostatic; nootropic; antiparkinsonian; cardiant; antiulcer; 

KW antiasthmatic; tranquiliser; neuroleptic; antidepressant; anticonvulsant; 

KW osteopathic; HIV infection; pain; cancer; anorexia; bulimia; 

KW Parkinson f s disease; acute heart failure; hypotension; hypertension; 

KW urinary retention; osteoporosis; angina pectoris; myocardial infarction; 

KW ulcers; asthma; allergy; delirium; dementia; 

KW benign prostatic hypertrophy; anxiety; schizophrenia; manic depression; 

KW dyskinesia; G coupled receptor; HFGAN72X; 7 transmembrane receptor; ss. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 154. .1431 

FT /*tag= a 

FT /product= "HFG7\N72X protein" 

FT /note= "G coupled receptor polypeptide" 

XX 

PN EP1154019-A2. 
XX 

PD 14-NOV-2001. 
XX 

PF 27-OCT-1997; 2001EP-00203008 . 
XX 

PR 30-APR-1997; 97US-00846704 . 

PR 27-OCT-1997; 97EP-00308563 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis CE; 

XX 

DR WPI; 2002-012659/02. 

DR P-PSDB; AAG78345. 
XX 

PT Nucleic acid encoding the HFGAN72X receptor, useful for diagnosis and 

PT treatment of e.g. infections, cancer, anorexia, bulimia, Parkinson's 

PT disease, and acute heart failure. 
XX 

PS Claim 3; Page 7; 24pp; English. 
XX 

CC The present sequence is that of a cDNA encoding a HFGAN72X polypeptide 

CC (AAG78345) . The specification describes a newly isolated polynucleotide 

CC encoding a HFoAN72X G coupled receptor polypeptide. The protein, or the 

CC invention has antibacterial, fungicide, virucide, protozoacide, anti-HIV, 

CC cardiant, analgesic, cytostatic, nootropic, antiparkinsonian, antiulcer, 

CC antiasthmatic, tranquiliser, neuroleptic, antidepressant, anticonvulsant 

CC and osteopathic activities. HFGAN72X polynucleotides (PNs) are used to 
CC . express HFGAN72X in vivo, to treat diseases requiring increased activity 

CC or expression of HFGAN72X; for recombinant production of HFGAN72X; 

CC diagnose diseases (or susceptibility to them) by detecting mutations in 

CC genomic sequences and in chromosome identification and mapping. HFGAN72X 

CC polypeptides are used to raise specific antibodies; as therapeutic agents 

CC ; to identify HFGAN72X protein-expressing clones; to purify HFGAN72X 

CC proteins; in vaccines. Cells transformed with HFGAN72X PNs are used to 



CC identify (ant ) agonists of HFGAN72X, useful therapeutically. Nucleic acids 

CC that inhibit expression of HFGAN72X and polypeptides that compete with < 

CC ligands for binding to HFGAN72X proteins are also useful therapeutically 

CC and diagnostically . HFGAN72X-related diseases include infections 

CC (bacterial, viral, fungal or protozoal, particularly HIV-1 or -2); pain; 

CC cancer; anorexia; bulimia; Parkinson's disease; acute heart failure; 

CC hypotension; hypertension; urinary retention; osteoporosis; angina 

CC pectoris; myocardial infarction; ulcers; asthma; allergy; benign 

CC prostatic hypertrophy; anxiety; schizophrenia; manic depression; delirium 

CC ; dementia; severe mental retardation and dyskinesias 

XX 

SQ Sequence 1564 BP; 271 A; 511 C; 435 G; 347 T; 0 U; 0 Other; 



Query Match 97.2%; Score 1079.4; DB 6; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 8.9e-246; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0 



Qy 


1 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 !! I 1 I 1 

1 1 1 1 1 1 1 jiii i ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i \ | \ | | | | | | | [ j | \ i \ 




Db 


154 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


213 


Qy 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 I I I I I I I | 




Db 


214 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


273 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 | | | | | | | | | | 1 1 1 1 1 1 1 1 




Db 


274 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


333 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | || | 




Db 


334 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


393 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


394 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


453 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Db 


454 


1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


513 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 | | 




Db 


514 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


573 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCl'^CCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 




Db 


574 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


633 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


634 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


693 


Qy 


541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 






1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


694 


GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


753 



Qy 601 CT CT GTCAT GAAC GCT GGGCAGAT GAC C T CT AT C C CAAGAT CT AC C ACAGT TGCTTCTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 

Db 754 GT CT GT GAT GAAC GCT GGGCAGAT GAC CT CTAT C C CAAGAT CT AC CACAGT TGCTTCTTT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 72 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

Qy 841 CGC GC CTT C CT GGCT GAAGT GAAGCAGAT GC GT GCACGGAGGAAGACAGC CAAGAT GCT G 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 994 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I 
Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I II I 
Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 1089 

I I I I I I I I I 
Db 1234 CTCAGTGGC 1242 



RESULT 10 
ABZ42789 



ID ABZ42789 standard; DNA; 1564 BP. 
XX 

AC ABZ42789; 
XX 

DT 04-MAR-2003 (first entry) 
XX 

DE HumaK orexin receptor 1 nucleotide SEQ ID NO: 367 . ., 
XX 

KW G protein-coupled receptor; GPCR; antigenic peptide; gene therapy; 

KW G protein-coupled receptor modulator; antibody; immune- related disease; 

KW growth-related disease; cell regeneration-related disease; AIDS; cancer; 

KW immunological-related cell proliferative disease; autoimmune disease; 

KW Alzheimer's disease; atherosclerosis; infection; osteoarthritis; allergy; 

KW osteoporosis; cardiomyopathy; inflammation; Crohn's disease; diabetes; 

KW graft versus host disease; Parkinson's disease; multiple sclerosis; pain; 

KW psoriasis; anxiety; depression; schizophrenia; dementia; memory loss; 

KW mental retardation; epilepsy; asthma; tuberculosis; obesity; nausea; 

KW hypertension; hypotension; renal disorder; rheumatoid arthritis; trauma; 



KW ulcer; gene; ds . 
XX 

OS Homo sapiens. 
XX 

PN WO200261087-A2. 
XX 

PD 08-AUG-2002. 
XX 

PF 19-DEC-2001; 2001WO-US050107 . 
XX 

PR 19-DEC-2000; 2000US-0257144P . 
XX 

PA (LIFE-) LIFESPAN BIOSCIENCES INC. 
XX 

PI Burmer GC, Roush CL, Brown JP; 
XX 

DR WPI; 2003-046718/04. 

DR P-PSDB; ABP81941. 
XX 

PT New isolated antigenic peptides e.g., for G protein-coupled receptors 

PT (GPCR) , useful for diagnosing and designing drugs for treating conditions 

PT in which GPCRs are involved, e.g. AIDS, Alzheimer f s disease, cancer or 

PT autoimmune diseases . 

XX 

PS Disclosure; Fig 1; 523pp; English. 
XX 

CC The present invention describes antigenic peptides (I) comprising: (a) 

CC any one of 1601 sequences (see ABP82019 to ABP8.3619) of 12-24 amino 

CC acids. Also described: (1) an assay for the detection of a particular G 

CC protein-coupled receptor (GPCR) or a candidate polypeptide in a sample; 

CC and (2) an isolated antibody having high specificity and high affinity or 

CC avidity for a particular GPCR. (I) can be used as GPCR modulators and in 

CC gene therapy. The antigenic peptides for GPCRs are useful in detecting an 

CC antibody against a particular GPCR, and in the production of specific 

CC antibodies. The peptides and antibodies are also useful for detecting the 

CC presence or absence of corresponding GPCRs. The antigenic peptides for 

CC GPCRs and antibodies are useful for diagnosing and designing drugs for 

CC treating immune- related diseases, growth-related diseases, cell 

CC regeneration-related disease, immunological-related cell proliferative 

CC diseases, or autoimmune diseases, e.g. AIDS, Alzheimer f s disease, 

CC atherosclerosis, bacterial, fungal, protozoan or viral infections, 

CC osteoarthritis, osteoporosis, cancer, cardiomyopathy, chronic and acute 

CC inflammation, allergies, Crohn's disease, diabetes, graft versus host 

CC disease, Parkinson's disease, multiple sclerosis, pain, psoriasis, 

CC anxiety, depression, schizophrenia, dementia, mental retardation, memory 

CC loss, epilepsy, asthma, tuberculosis, obesity, nausea, hypertension, 

CC hypotension, renal disorders, rheumatoid arthritis, trauma, ulcers, or 

CC any other disorder in which GPCRs are involved. The antibodies may be 

CC used in immunoassays and iirariunodi a gnosis . ABZ42523 to ABZ42869 encode 

CC GPCR proteins given in ABP81675 to ABP82018, which are used in the 

CC exemplification of the present invention 

XX 

SQ Sequence 1564 BP; 268 A; 513 C; 436 G; 347 T; 0 U; 0 Other; 



Query Match 97.2%; Score 1079.4; DB 7; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 8.9e-246; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 



0 



1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 
I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

61 TCCCCTGTGCCTC CAGACT AT GAAGAT GAGT T T CT CCGCT AT CT GT GGC GT GAT TAT C T G 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I 
214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 

121 T AC C C AAAAC AGT AT GAGT G GGT C CT C AT C GC AGC CT AT GT GGCT GT GT T C GT C GT GGC C 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I II I I I I I 

274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
334 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 

241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 

301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I 
454 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

'481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I II I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

601 CT CTGT CAT GAACGCT GGGCAGAT GACCTCT AT C C CAAGATCT ACCAC AGT TGCTTCTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
754 GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 813 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCAToGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I II I I I I I I I I I I I I I I I 

874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 



Qy 841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I I I I I I I I I I l -l I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 994 CGCGCCTT C CT G GCT GAAGT GAAGC AGAT G C GT GCAC G GAGGAAGACAGC CAAGAT GCT G 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I II I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1174 AC C TT CT C C C ACT G GCT GGTGT AC GC CAACAG CGCT GC CAAC C C CAT CAT CT ACAACT T C 1233 

Qy 1081 CTCAGTGGC 10 8 9 

MINIMI 
Db 1234 CTCAGTGGC 1242 

RESULT 11 
AAV68512 

ID AAV68512 standard; cDNA; 1133 BP. 
XX 

AC AAV68512; 
XX 

DT 29-JAN-1999 (first entry) 
XX 

DE Nucleotide sequence of HGS EST 557082. 
XX 



KW HGS EST 557082; G-protein coupled receptor family; HFGAN72Y; mutation; 

KW probe; agonist; antagonist; activation; inhibition; gene therapy; 

KW antibody; immune response; vaccine; HIV-1; HIV-2; cancer; anorexia; 

KW bulimia; asthma; Parkinson f s disease; acute heart failure; hypotension; 

KW hypertension; urinary retention; osteoporosis; angina pectoris; 

KW myocardial infarction; ulcer; allergies; psychotic disorder; 

KW neurological disorder; gene mapping; ss. 

XX 

OS Homo sapiens. 
XX 

PN EP875565-A2. 
XX 

PD 04-NOV-1998. 
XX 

PF 27-OCT-1997;^ 97EP-00308554 . ... - vr 



XX 

PR 30-APR-1997; 97US-008 46705 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis C; 
XX 

DR WPI; 1998-570286/49. 
XX 

PT New G-protein coupled receptor HFGAN72Y polypeptide and polynucleotide - 
PT useful as diagnostic reagents and for prevention and treatment of HIV 



PT infections, cancer, osteoporosis and Parkinson's disease-. 
XX 

PS Example 1; Page 18-19; 22pp; English. 
XX 

CC This is the nucleotide sequence of the HGS EST 557082 used in the method 

CC of the invention involving the G-protein coupled . receptor , HFGAN72Y. Its 

CC polypeptides and polynucleotides are useful for diagnosing susceptibility 

CC to diseases by detecting mutations in the HFGAN72Y gene using probes 

CC containing the HFGAN72Y nucleotide sequence, and can diagnose diseases 

CC associated with HFGAN72Y imbalance by determining HFGAN72Y polypeptide or 

CC mRNA expression levels. Agonists/antagonists can be used in treatment to 

CC activate/inhibit HFGAN72Y activity, in addition to direct administration 

CC of antisense sequences to prevent expression, or HFGAN72Y polypeptides to 

CC treat conditions associated with a lack HFGAN72Y protein. Gene therapy 

CC may also be used to affect endogenous HFGAN72Y polypeptide production. 

CC HFGAN72Y antibodies are useful for inducing an immune response to 

CC immunise and prevent diseases, and for isolating HFGAN72Y clones or 

CC purifying the polypeptides by affinity chromatography. HFGAN72Y 

CC polypeptides can be administered directly or as a vaccine to inoculate 

CC against diseases. Diseases diagnosed, prevented or treated include HIV-1 

CC or HIV-2 infections, pain, cancers, anorexia, bulimia, asthma, 

CC Parkinson's disease, acute heart failure, hypotension, hypertension, 

CC urinary retention, osteoporosis, angina pectoris, myocardial infarction, 

CC ulcers; allergies, benign prostatic hypertrophy, and psychotic and 

CC neurological disorders. The HFGAN72Y polypeptide is also useful for 

CC mapping the gene to a chromosome, allowing gene inheritance to be studied 

CC through linkage analysis 

XX 

SQ Sequence 1133 BP; 202 A; 366 C; 314 G; 251 T; 0 U; 0 Other; 



Query Match 97.2%; Score 1078.4; DB 2; Length 1133; 

Best Local Similarity 99.4%; Pred. No. 1.4e-245; 

Matches 1082; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 



Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

Qy 61 TCCCCTGTGCCTC CAGACT AT GAAGAT GAGTTT CT CC GCT ATCT GT GG C GT GATTAT CT G 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 TCCCCTGTGCCTC CAGACT AT GAAGAT GAGT T T CT CC GCT AT CTGT G GC GT GATTAT C T G 120 

Qy 121 T AC C CAAAAC AGT AT GAGT GG GT C CT C AT C GC AGC CT AT GT GGCT GT GT T C GT C GT GGC C 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I 1 1 I I I i 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 



Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I | M 

Db 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 



Qy 361 GTCATCCCCTATCTAGAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCT.CAGCTTCATC 420 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 81 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

Qy 601 CT CT GT CAT GAAC GC T GGGCAGAT GAC CT C T AT CCCAAGAT CT AC CACAGTT GCT T CTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 72 0 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

Qy 841 CGCGCCT T CCTGGCT GAAGT GAAGCAGAT GCGT GCACGGAGGAAGACAGCCAAGAT GCTG 900 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 CGCGC CT T CCTGGCT GAAGT GAAG CAGAT GC GT GCACGGAGGAAGAC AGC CAAGAT GCT G 900 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | I | I I I I I I I I I I I I I I I I I 
Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy > 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

Qy 1081 CTCAGTGG 1088 

I I I I I I I I 
Db 1081 CTCAGTGG 1088 



RESULT 12 
ABA96020 

ID ABA96020 standard; cDNA; 1133 BP. 



j 



XX 

AC ABA96020; 
XX 

DT 12-MAR-2002 (first entry) 
XX 

DE HGS EST 557082. 
XX 

KW G-protein; receptor; HFGAN72Y; cytostatic; cardiant; analgesic; cancer; 

KW nootropic; tranquillising; neuroprotective; anti-asthmatic; gene therapy; 

KW infection; HIV-1; pain; anorexia; bulimia; Parkinson's disease; ulcer; 

KW cardiac disease; urinary retention; asthma; allergy; psychotic disorder; 

KW benign prostatic hypertrophy; neurological disorder; anxiety; delirium; 

KW schizophrenia; manic depression; dementia; mental retardation; EST; 

KW dyskinesia; Huntington f s disease; Tourette's syndrome; HIV-2; 

KW HGS EST 557082; expressed sequence tag; ss. 
XX 

OS Homo sapiens. 
XX 

PN EP1156110-A2. 

XX 

PD 21-NOV-2001. 
XX 

PF 27-OCT-1997; 2001EP-00203010 . 
XX 

PR 30-APR-1997; 97US-00846705 . 

PR 27-OCT-1997; 97EP-00308554 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis CE; 
XX 

DR WPI; 2002-084320/12. 
XX 

PT New polynucleotide encoding a G-protein coupled receptor designated 

PT HFGAN72Y is useful to diagnose and treat associated diseases including 

PT cancer, infection, cardiac disease and psychotic and neurological 

PT disorders. 
XX 

PS Example 1; Page 18-19; 22pp; English. 
XX 

CC The sequence represents HGS EST 557082. The invention relates to a novel 

CC isolated polynucleotide encoding HFGAN72Y polypeptide. The polypeptide of 

CC the invention has cytostatic, cardiant, analgesic, tranquillising, 

CC nootropic, neuroprotective, and anti-asthmatic activity. The HFGAN72Y has 

CC a use in gene therapy. The HFGAN72Y polynucleotide or an HFGAN72Y 

CC polypeptide agonist are used to treat a subject in need of enhanced 

CC HFGAN72Y activity or expression. An HFGAN72Y antagonist or competitor, or 

CC nucleic acid which inhibits HFGAN72Y expression is used to treat a 

CC subject in need of decreased HFGAN72Y activity or expression. HFGAN72Y- 

CC associated diseases include infections, particularly by HIV-1 or HIV-2, 

CC pain, anorexia, bulimia, Parkinson's disease, cardiac diseases, cancers, 

CC ulcers, urinary retention, asthma, allergies, benign prostatic 

CC hypertrophy, and psychotic and neurological disorders including anxiety, 

CC schizophrenia, manic depression,' delirium, dementia, severe mental 

CC retardation and dyskinesias such as Huntington's disease and Tourette's 

CC syndrome 

XX 



SQ Sequence 1133 BP; 202 A; 366 C; 314 G; 251 T; 0 U; 0 Other; 



Query Match 97.2%; Score 1078.4; DB 6; Length 1133; 

Best Local Similarity 99.4%; Pred. No. 1.4e-245; 

Matches 1082; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

Db 121- TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M II I I I I I I I I I I M II M I I I 

Db 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 42 0 

II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1.1 I I I I I I I I I 
Db 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

Qy 601 CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCT^AGATCTACCACAGTTGCTTCTTT 660 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 72 0 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 



Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I II I I I I I I II I I I I I 

Db 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

Qy 841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 841 C GC GCCTT C CT GGCT GAAGT GAAGC AGAT GC GT GCAC G GAG GAAGAC AGC CAAGAT G CT G 900 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 ACCTT CTCCCACT GGCTGGT GTACGCCAACAGCGCTGCCAACCCCAT CAT CTACAACTT C 1080 

Qy 1081 CTCAGTGG 1088 

I I I I I I I I 
Db 1081 CTCAGTGG 1088 



RESULT 13 
AAV68511 - 

ID AAV68511 standard; cDNA; 1170 BP. 
XX 

AC AAV68511; 
XX 

DT 29-JAN-1999 (first entry) 
XX 



DE Nucleotide sequence of HFGAN72Y a G-protein coupled receptor. 
XX 

KW G-protein coupled receptor family; HFGAN72Y; mutation; probe; agonist; 

KW antagonist; activation; inhibition; gene therapy; antibody; 

KW immune response; vaccine; HIV-1; HIV-2; cancer; anorexia; bulimia; 

KW asthma; Parkinson ! s disease; acute heart failure; hypotension; 

KW hypertension; urinary retention; osteoporosis; angina pectoris; 

KW myocardial infarction; ulcer; allergies; psychotic disorder; 

KW neurological disorder; gene mapping; ss. 

XX 

OS Homo sapiens. 

XX . 

FH Key Location/Qualifiers 

FT CDS 1. .1170 

FT /*tag= a 

FT /product= "HFGAN72Y protein" 
XX 

PN EP875565-A2. 
XX 

PD 04-NOV-1998. 
XX 

PF 27-OCT-1997; 97EP-00308554 . 
XX 



PR 30-APR-1997; 97US-00846705 . 
-XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
XX 

PI Bergsma DJ, Ellis C; 
XX 

DR WPI; 1998-570286/49. 

DR P-PSDB; AAW80805. 
XX 

PT New G-protein coupled receptor HFGAN72Y polypeptide and polynucleotide - 

PT useful as diagnostic reagents and for prevention and treatment of HIV 

PT infections, cancer, osteoporosis and Parkinson's disease. 
XX 

PS Claim 3; Page 7; 22pp; English. 
XX 

CC This is the nucleotide sequence of the G-protein coupled receptor, 

CC HFGAN72Y used in the method of the invention. HFGAN72Y polypeptides and 

CC polynucleotides are useful for diagnosing susceptibility to diseases by 

CC detecting mutations in the HFGAN72Y gene using probes containing the 

CC HFGAN72Y nucleotide sequence, and can diagnose diseases associated with 

CC HFGAN72Y imbalance by determining HFGAN72Y polypeptide or mRNA expression 

CC levels . Agonists/antagonists can be used in treatment to activate/inhibit 

CC HFGAN72Y activity, in addition to direct administration of antisense 

CC sequences to prevent expression, or HFGAN72Y polypeptides to treat 

CC conditions associated with a lack HFGAN72Y protein. Gene therapy may also 

CC be used to affect endogenous HFGAN72Y polypeptide production. HFGAN72Y 

CC antibodies are useful for inducing an immune response to immunise and 

CC prevent diseases, and for isolating HFGAN72Y clones or purifying the 

CC polypeptides by affinity chromatography. HFGAN72Y polypeptides can be 

CC administered directly or as a vaccine to inoculate against diseases. 

CC Diseases diagnosed, prevented or treated include HIV-1 or HIV-2 

CC infections, pain, cancers, anorexia, bulimia, asthma, Parkinson 1 s 

CC disease, acute heart failure, hypotension, hypertension, urinary 

CC retention, osteoporosis, angina pectoris, myocardial infarction, ulcers; 

CC allergies, benign prostatic hypertrophy, and psychotic and neurological 

CC disorders. The HFGAN72Y polypeptide is also useful for mapping the gene 

CC to a chromosome, allowing gene inheritance to be studied through linkage 

CC analysis 

XX 

SQ Sequence 1170 BP; 208 A; 381 C; 322 G; 259 T; 0 U; 0 Other; 

Query Match 97.2%; Score 1078.4; DB 2; Length 1170; 

Best Local Similarity 99.4%; Pred. No. 1.4e-245; 

Matches 1082; Conservative 0; Mismatches 6; Indels 0; Gaps 0 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 T C C C CTGT GC CT C CAGACT AT GAAGAT GAGTTT CT C C GCTAT CT GT GGC GT GAT T AT CT G 120 



Qy 

Db 



121 
121 



180 
180 



Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I || I 
Db 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 4 80 

I I I I I I I 1 I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 81 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I III I I I I I I I I I I || I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | 
Db 541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

Qy 601 CT CT GT CAT GAAC GCT GG G C AGAT GAC CT CTATCC CAAGAT CT AC CACAGT TGCTTCTTT 660 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

Qy 7 81 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

Qy 841 CGCGCCTTCCTGGCT GAAGT GAAGC AGAT G C GT G C AC GGAGGAAGACAGCCAAGAT G CT G 900 

■ y II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II m I I I I I I I I I 
Db 841 C GC GC CTT C CT G GCT GAAGT GAAG C AGAT GCGT G C AC GGAGGAAGACAGC CAAGAT GCT G 900 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I M I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 



Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

Qy 1081 CTCAGTGG 1088 

I I I I I I I I 
Db 1081 CTCAGTGG 1088 



ABA96019; 

12-MAR-2002 (first entry) 

G-protein coupled receptor (HFGAN72Y) cDNA. 

G-protein; receptor; HFGAN72Y; cytostatic; cardiant; analgesic; cancer; 
nootropic; tranquillising; neuroprotective; anti-asthmatic; gene therapy; 
infection; HIV-1; pain; anorexia; bulimia; Parkinson's disease; ulcer; 
cardiac disease; urinary retention; asthma; allergy; psychotic disorder; 
benign prostatic hypertrophy; neurological disorder; anxiety; delirium; 
schizophrenia; manic depression; dementia; mental retardation; 
dyskinesia; Huntington's disease; . Tourette 1 s syndrome; HIV-2; ss. 



Location/Qualifiers 
1. .1170 
/*tag= a 

/product= "HGFAN72Y" 



RESULT 14 
ABA96019 

ID ABA96019 standard; cDNA; 1170 BP. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
PT 
XX 
PS 
XX 
CC 



Homo sapiens . 

Key 
CDS 



EP1156110-A2. 



21-NOV-2001. 



27-OCT-1997; 2001EP-00203010 . 



30-APR-1997; 
27-OCT-1997; 



97US-00846705. 
97EP-00308554. 



(SMIK ) SMITHKLINE BEECHAM CORP. 

Bergsma DJ, Ellis CE; 

WPI; 2002-084320/12. 
P-PSDB; ABB08208. 



New polynucleotide 
HFGAN72Y is useful 
cancer, infection, 
disorders . 



encoding a G-protein coupled receptor designated 
to diagnose and treat associated diseases including 
cardiac disease and psychotic and neurological 



Claim 3; Page 7; 22pp; English. 

The sequence encodes G-protein coupled receptor HFGAN72Y. The invention 



CC relates to a novel isolated polynucleotide encoding HFGAN72Y polypeptide. 

CC The polypeptide of the invention has cytostatic, cardiant, analgesic/ 

CC tranquillising, nootropic, neuroprotective, and anti-asthmatic activity. 

CC The HFGAN72Y has a use in gene therapy. The HFGAN72Y polynucleotide or an 

CC HFGAN72Y polypeptide agonist are used to treat a subject in need of 

CC enhanced HFGAN72Y activity or expression. An HFGAN72Y antagonist or 

CC competitor, or nucleic acid which inhibits HFGAN72Y expression is used to 

CC treat a subject in need of decreased HFGAN72Y activity or expression. 

CC HFGAN72Y-associated diseases include infections, particularly by HIV-1 or 

CC HIV-2, pain, anorexia, bulimia, Parkinson's disease, cardiac diseases, 

CC cancers, ulcers, urinary retention, asthma, allergies, benign prostatic 

CC hypertrophy, and psychotic and neurological disorders including anxiety, 

CC schizophrenia, manic depression, delirium, dementia, severe mental 

CC retardation and dyskinesias such as Huntington's disease and Tourette 1 s 

CC syndrome 

XX 

SQ Sequence 1170 BP; 208 A; 381 C; 322 G; 259 T; 0 U; 0 Other; 



Query Match 97.2%; Score 1078.4; DB 6; Length 1170; 

Best Local Similarity 99.4%; Pred. No. 1.4e-245; 

Matches 1082; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 



Ov 


i 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 






1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | 




Db 


l 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 


Qy 


61 


T CCCCT GT G CCT C CAGACT AT GAAGAT GAGT T T CT C C GCT AT CT GT GG C GT GAT TAT CT G 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 




Db 


61 


TCCCCTGTGCCTC CAGAC TAT GAAGAT GAGT T T C T CC GCT AT CT GT GGC GT GATT AT CT G 


120 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 




Db 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I I I I 




Db 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



541 .GCAGTCATGCAATCCAGCAGTGT.GCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

601 CT CT GT CAT GAACGCTGGGCAGATGACCT CTATCCCAAGAT CTACCACAGTTGCTTCTTT 660 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

601 GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

841 CGCGCCTTCCTGGCT GAAGT GAAG C AGAT GC GT GC AC G GAGGAAGACAGCCAAGAT GCT G 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

841 CGCGCCTTCCTGGCT GAAGT GAAGCAGAT GC GT G C AC GGAGGAAGACAGCCAAGAT GCT G 900 

901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I 
961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
102 1 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

1081 CTCAGTGG 1088 

I I I I I I I I 

1081 CTCAGTGG 1088 



RESULT 15 
AAS00491 



ID 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 



AAS00491 standard; cDNA; 1278 BP. 
AAS00491; 

17-MAY-2001 (first entry) 

Human neuropeptide receptor cDNA. 

Human; neuropeptide receptor; neuropeptide Y receptor; obesity; 

nervous system disorder; hyperprolif erative disorder; diabetes mellitus; 

cardiovascular disorder; autoimmune disorder; infectious disorder; 

eating behaviour disorder; narcolepsy; neurological disease; 

narcotics addiction; nicotine addiction; alcohol addiction; gene therapy; 



KW 
XX 
OS 
XX 
FH 
FT 
FT 



protein co-ordinate data; chromosome 1; ss. 



Homo sapiens. 



FT 



Key 
CDS 



Location/Qualif iers 
1. .1278 
/*tag= a 

/product= "neuropeptide receptor 



M 



XX 

PN WO200117532-A1. 
XX 

PD 15-MAR-2001. 
XX 

PF 07-SEP-2000; 2000WO-US024518 . 
XX 

PR 10-SEP-1999; 99US-00393696 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Soppet DR, Li Y, Rosen CA; 
XX 

DR WPI; 2001-183276/18. 

DR P-PSDB; AAU00438*. 
XX 

PT A new nucleic acid encoding a human neuropeptide receptor polypeptide, 

PT useful for preventing, treating or ameliorating obesity, narcolepsy, 

PT neurological disease and addiction to narcotics, nicotine and alcohol. 
XX 

PS Claim 4; Fig 1; 385pp; English. 
XX 

CC The present sequence encodes for a novel human neuropeptide receptor 

CC which shows sequence homology to the neuropeptide Y receptor. Two splice 

CC variants of the neuropeptide receptor (AAU00439-AAU00440) and a possible 

CC mutant (AAU00442) are also described. Polypeptides and polynucleotides of 

CC the neuropeptide receptor are useful for diagnosing, preventing, or 

CC treating a pathological condition in a subject related to the central 

CC nervous and peripheral nervous systems (CNS and PNS) . The polypeptides 

CC and polynucleotides may be used to treat hyperprolif erative, 

CC cardiovascular, autoimmune, nervous system or infectious disorders e.g. 

CC cancer, heart disease, rheumatoid arthritis, Alzheimer f s disease, HIV 

CC infection and diabetes mellitus . In particular they are useful for 

CC preventing, treating or ameliorating a medical condition in a mammal such 

CC as obesity/eating behaviour disorders, narcolepsy, neurological disease, 

CC addiction to narcotics, nicotine and alcohol, chronic pain, acute pain, 

CC migraine headaches and anxiety disorders. The polynucleotides encoding 

CC the neuropeptide receptor can also be used in gene therapy methods for 

CC treating such diseases 

XX 

SQ Sequence 1278 BP; 220 A; 426 C; 347 G; 285 T; 0 U; 0 Other; 

Query Match 97.1%; Score 1077.8; DB 4; Length 1278; 
Best Local Similarity 99.4%; Pred. No. 2e~245; 

Matches 1082; Conservative 0; Mismatches, 7; Indels 0; Gaps 0 



Qy 

Db 



1 AT G GAGC C CT CAG C CAC CC CAG GGG CC CAGAT GGGGGTCCCCC CTGGC AG CAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I > I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 



Qy^ 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I i I I I | I I I I I I I I I I I I I I I I I I I I I 
Db 61 TCCCCTGTGCCTC C AGACT AT GAAGAT GAGTT T CT C C GCTAT CT GT GGC GT GATTAT CT G 120 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I i I I I I I I I I I 

Db 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 GT CAT CCCCTAT CTACAGGCT GT GT CCGT GT CAGT GGCAGT GCT AACT CT CAGCTT CATC 420 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 81 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

Qy 601 C T CT GT CAT GAAC GCT GGGCAGAT GAC CT CT AT C CCAAGAT CT AC C ACAGT TGCTTCTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 GT CT GT GAT GAAC GCT GGG CAGAT GAC CT CT AT C CCAAGAT CT AC C ACAGT T GCTT CT T T 660 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

Qy 721 A/ N 1 .GCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGPA.GCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II 
Db 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

Qy 7 81 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGCC 840 

Qy 841 CGCGCCTTCCTGGCT GAAGT GAAGCAGAT GC GT GC AC GGAG GAAGACAGC CAAGATGCT G 900 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 C GC GC CTT C CT GGCT GAAGT GAAGCAGAT G CGT GCAC G GAGGAAGACAGC CAAGATGCT G 900 



Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I | | | | |. | I I I I I | | | | | | | I I I I I I I I I I I I 

Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

1 1 1 1 1 I 1 1 i 1 1 1 1 1 1 1 1 1 I I 1 1 I 1 1 1 1 1 1 I I I 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 

Qy 1081 CTCAGTGGC 1089 

MINIMI 
Db 1081 CTCAGTGGC 108 9 

Search completed: October 15, 2004, 16:01:45- 
Job time : 477.138 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 15, 2004, 15:25:17 ; Search time 87.0094 Seconds 

(without alignments) 
7079.645 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-070-532-3 
1110 

1 atggagccctcagccacccc ttccctggagtctgctctaa 1110 



Scoring table: I DENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 682709 seqs, 277475446 residues 



Total number of hits satisfying chosen parameters: 1365418 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB. seq: * 

2: /cgn2 6/ptodata/2/ ina/ 5B_C0MB . seq : * 

3 : /cgn2_6/ptodata/2/ina/6A_COMB. seq: * 

4 : /cgn2_6/ptodata/2/ina/6B_COMB. seq: * 

5: /cgn2_6/ptodata/2/ina/PCTUS_COMB.seq: 

6: /cgn2_6/ptodata/2/ina/backf ilesl . seq: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 


1110 
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.0 


1110 


5 


PCT-US95-05616-3 


Sequence 


3, 


Appli 


2 


1102 


99 


.3 


1110 


4 


US-08-462-509B-3 


Sequence 


3, 


Appli 


3 


1079.4 


97 


.2 


1209 


4 


US-08-462-509B-1 


Sequence 


1, 


Appli 


4 


1079.4 


97 


.2 


1564 


2 


US-08-846-705-4 


Sequence 


4, 


Appli 


5 


1079.4 


97 


.2 


^1564 


3 


US-08-846-704-1 


Sequence 


1, 


Appli 


6 


1079.4 


97 


.2 


1564 


3 


US-08-846-704-3 


Sequence 


3, 


Appli 


7 


1078.4 


97 


.2 


1133 


2 


US-08-846-705-3 


Sequence 


3, 


Appli 


8 


1078.4 


97 


.2 


1170 


2 


US-08-846-705-1 


Sequence 


1, 


Appli 


9 


1077.8 


97 


.1 


1209 


5 


PCT-US95-05616-1 


Sequence 


1, 


Appli 


10 


1075.2 


96 


.9 


1116 


4 


US-08-462-509B-5 


Sequence 


5, 


Appli 


11 


1075.2 


96 


.9 


1133 


5 


PCT-US95-05616-5 


Sequence 


5, 


Appli 



12 


688 


62. 


0 


843 


3 


US- 


-08- 


-513- 


-974B-375 


Sequence 


375, App 


13 


661 


59. 


5 


789 


3 


us- 


-08- 


-513- 


-974B-55 


Sequence 


55, Appl 


14 


661 


59. 


5 


789 


4 


us- 


-09- 


-461- 


-436B-55 


Sequence 


55, Appl 


15 


511. 


8 


46. 


1 


1633 


3 


US- 


-09- 


-119- 


-788-1 


Sequence 


1, Appli 


16 


241. 


2 


21. 


7 


9785 


4 


US- 


-09- 


-479- 


-128-1 


Sequence 


1, Appli 


17 


161. 


6 


14. 


6 


1293 


3 


US- 


-09- 


-255- 


-368-7 


Sequence 


7, Appli 


18 


154. 


6 


13. 


9 


1410 


3 


us- 


-09- 


-255- 


-368-1 


Sequence 


1, Appli 


19 


120. 


4 


10. 


8 


168575 


4 


us- 


-09- 


-426- 


■290-1 


Sequence 


1, Appli 


20 


117. 


4 


10. 


6 


1110 


3 


us- 


08- 


-513- 


-974B-31 


Sequence 


31, Appl 


21 


117 . 


4 


10. 


6 


1110 


3 


us- 


08- 


-776- 


•971-26 


Sequence 


26, Appl 


22 


117. 


4 


10. 


6 


1110 


4 


us- 


09- 


-461- 


436B-31 


Sequence 


31, Appl 


23 


117. 


4 


10. 


6 


1331 


3 


us- 


08- 


■513- 


974B-322 


Sequence 


322, App 


24 


117. 


4 


10. 


6 


1331 


3 


us- 


08- 


776- 


971-103 


Sequence 


103, App 


25 


108. 


8 


9. 


8 


1113 


3 


us- 


09- 


172- 


353-1 


Sequence 


1, Appli 


26 


108. 


8 


9. 


8 


1113 


4 


us- 


09- 


799- 


955-1 


Sequence 


1, Appli 


27 


107. 


2 


9. 


7 


669 


3 


us- 


08- 


513- 


974B-314 


Sequence 


314, App 


28 


107. 


2 


9. 


7 


669 


3 


us- 


08- 


776- 


971-99 


Sequence 


99, Appl 


29 


101. 


6 


9. 


2 


1110 


4 


us- 


09- 


170- 


496D-25 


Sequence 


25, Appl 


30 


101. 


6 


9. 


2 


1110 


4 


us- 


09- 


170- 


496D-177 


Sequence 


177, App 


31 


101. 


6 


9. 


2 


1344 


4 


us- 


09- 


016- 


434-1295 


Sequence 


1295, Ap 


32 


101. 


6 


9. 


2 


1356 


1 


us- 


07- 


978- 


892A-4 


Sequence 


4 , Appli 


33 


101. 


6 


9. 


2 


1535 


4 


us- 


09- 


016- 


434-1051 


Sequence 


1051, Ap 


34 


101. 


6 


9. 


2 


1969 


1 


us- 


07- 


937- 


609-28 


Sequence 


28, Appl 


35 


101. 


6 


9. 


2 


1969 


3 


us- 


08- 


029- 


170-28 


Sequence 


28, Appl 


36 


98. 


4 


8. 


9 


1621 


1 


us- 


08- 


722- 


001-13 


Sequence 


13, Appl 


37 


98. 


4 


8. 


9 


1776 


1 


us- 


08- 


722- 


001-29 


Sequence 


2 9, Appl 


38 


98. 


4 


8. 


9 


2002 


4 


us- 


09- 


016- 


434-1172 


Sequence 


1172, Ap 


39 


98. 


4 


8. 


9 


2140 


1 


us- 


08- 


334- 


698-1 


Sequence 


1, Appli 


40 


98. 


4 


8. 


9 


2140 


1 


us- 


08- 


228- 


932-1 


Sequence 


1, Appli 


41 


98. 


4 


8. 


9 


2140 


1 


us- 


08- 


468- 


939-1 


Sequence 


1, Appli 


42 


98. 


4 


8. 


9 


2140 


2 


us- 


08- 


406- 


855A-1 


Sequence 


1, Appli 


43 


98. 


4 


8. 


9 


2140 


2 


us- 


08- 


722- 


190-1 


Sequence 


1, Appli 


44 


98. 


4 


8. 


9 


2140 


3 


us- 


08- 


244- 


354-1 


Sequence 


1, Appli 


45 


98. 


4 


8. 


9 


2140 


3 


us- 


09- 


206- 


899-1 


Sequence 


1, Appli 



ALIGNMENTS 



RESULT 1 

PCT-US95-05616-3 

; Sequence 3, Application PC/TUS9505616 
; GENERAL INFORMATION: 

APPLICANT: LI, ET AL. 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 
; ADDRESSEE: CECCHI, STEWART & OLSTEIN 

STREET: 6 BECKER FARM ROAD 

CITY: ROSELAND 

STATE: NEW JERSEY 

COUNTRY: USA 

ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 

COMPUTER: IBM PS/2 



OPERATING SYSTEM: MS-DOS 
; SOFTWARE: WORD PERFECT 5.1 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: PCT/US95/05616 

FILING DATE: concurrently 

CLASSIFICATION: 
; ATTORNEY/AGENT INFORMATION: 

NAME: FERRARO, GREGORY D. 

REGISTRATION NUMBER: 36,134 

REFERENCE/ DOCKET NUMBER: 325800-268 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 201-994-1700 

TELEFAX: 201-994-1744 
; INFORMATION FOR SEQ ID NO: ' 3: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 1110 BASE PAIRS 

TYPE: NUCLEIC ACID 
\ STRANDEDNESS : SINGLE 

TOPOLOGY: LINEAR 
; MOLECULE TYPE: cDNA 
PCT-US95-05616-3 



Query Match 100.0%; Score 1110; DB 5; Length 1110; 

Best Local Similarity 100.0%; Pred. No. 4.1e-255; 

Matches 1110; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 


1 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 U 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I | | | 




Db 


1 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 


Qy 


61 


T C C CCT GT GCCT C CAGACT AT GAAGAT GAGT T T CT CC GCT AT CT GT GGCGT GATTAT CT G 


120 






1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 I | | | | | | | | | | | M | | | | | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | 




Db 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 I I I I 




Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


C C C CT GGAC C G CTGGT AT GC CAT CT GC CAC C C ACT ATT GT T CAAGAGCACAGC C CGGC GG 


480 



Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i 
II 1 I" 1 II 1 1 1 1 1 II II 1 II II 1 II II 1 1 II 1 II 1 II 1 II 1 II 1 1 1 II 1 1 1 1 II 1 1 II II 




Db 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 


Qy 


541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 






i i i i i i i i i i i i i i i \ t i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 


Qy 


601 


CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 


660 






i i i i i i i t i i i i i i t i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


CT CT GT CAT GAAC GCTG GGCAGAT GAC CT CTAT C CCAAGAT CTAC CAC AGTT GCTT CT TT 


660 


Qy 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






i i i i i i i i i i i i i i i i i i i i ■ i i i i i i i i i a ■ i i i i i i i i i i i ■ i i i ■ • i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 


Qy 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 






1 1 [ 1 1 i 1 l 1 1 1 i 1 1 1 1 1 1 t 1 1 1 1 * 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I I I I 1 i i i i a > i ■ ■ i i i i 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M | | | | | | | | | | | | 




Db 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 


Qy 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 






i i i i i i i i i i i i t i i i i i i i i > i i i t i i i i i i i t i i i i i i i i i i i i i i i i i i > i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 




Db 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 


Qy 


841 


CGC GCCTT CCT GGCT GAAGT GAAGCAGAT GC GT GCAC GGAGGAAGACAGC CAAGAT GCT G 


900 






lllllllltllllllltlltlltirilllllllllllltltltllaaiiiiia i i 

1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 | | | | | | | | M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I | | | | | | | | | 




Db 


841 


CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 


900 


Qy 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 






1 1 1 1 1 1 1 i 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i 1 I I I I I I I I I I I I i i t i i a i i i i > ■ 
1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 


Qy 


961 


AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1020 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I 




Db 


961 


AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1020 


Qy 


1021 


ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 


1080 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 U 1 1 II 1 1 1 1 




Db 


1021 


ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 


1080 


Qy 


1081 


CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1.1 1 1 1 I 1 




Db 


1081 


CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 





RESULT 2 

US-08-462-509B-3 

; Sequence 3, Application US/08462509B 
; Patent No. 6410701 
; GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
' CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 



CITY: Rockiville 
; STATE: MD 

COUNTRY: USA 

ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/462, 509B 

FILING DATE: 05-JUN-1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/US95/05616 

FILING DATE: 05-MAY-1995 
ATTORNEY/ AGENT INFORMATION: 

NAME: Wales, Michele M. 

REGISTRATION NUMBER: 43,975 

REFERENCE/ DOCKET NUMBER: PF168P1 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-309-8504 

TELEFAX: 301-309-8439 
; INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1110 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
; FEATURE: 
; NAME/KEY: CDS 

LOCATION: 1..1110 
US-08-462-509B-3 

Query Match 99.3%; Score 1102; DB 4; Length 1110; 

Best Local Similarity 99.5%; Pred. No. 3.3e-253; 

Matches 1105; Conservative 0; Mismatches 5; Indels 0; Gaps 0 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | 

Db 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 
I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I > I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
421 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | || 
541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

601 CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

I I l-l I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

601 GT CT GT GAT GAAC G CT GGGC AGAT GAC CT CT AT C C CAAGAT CT AC CAC AGT TGCTTCTTT 660 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I U I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | 
901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I. I I I I I I I I I I I I I I I I I I I I 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

MINIUM I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

1081 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1081 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 



RESULT 3 

US-08-462-509B-1 

; Sequence 1, Application US/08462509B 

; Patent No. 6410701 

; GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al < 
TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 
CITY: Rockiville 
STATE: MD 
COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/4 62 , 509B 
FILING DATE: 05-JUN-1995 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/US95/ 05616 
FILING DATE: 05-MAY-1995 
; ATTORNEY/AGENT INFORMATION: 
NAME: Wales, Michele M. 
REGISTRATION NUMBER: 43,975 
REFERENCE/ DOCKET NUMBER: PF168P1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
TELEFAX: 301-309-8439 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1209 base pairs 
TYPE: nucleic acid 
; STRANDEDNESS: single 

; TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic) 
FEATURE : 

NAME/ KEY: CDS 
; LOCATION: ±\ .1209 

US-08-462-509B-1 

Query Match 97.2%; Score 1079.4; DB 4; Length 1209; 

Best Local Similarity 99.4%; Pred. No. 8.1e-248; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0 
Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 



Qy 



61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


61 


TCCCCTGTGCCTC CAGACTAT GAAGAT GAGT TT CT C C G CT AT CT GT GGC GTGAT TAT CT G 


120 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






i i i i i i i i i i i i ■ i i i i ■ i i i i i i i i i i i i i i i i i i i i i < i i i i ■ i 

II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






lllllllllllltllllllllllllllllllllllllllilll»lllittaftll*4«i*i 

1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 I I I I I II 1 I 1 I I | | || | | | | | || | || | || | | 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | 




Db 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 I I 




Db 


361 


GT CAT C C C CT AT CT ACAGGCTGT GTC C GT GT CAGT GG C AGT GCTAACT CT CAGCT T CAT C 


420 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 




Db 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 


Qy 


541 


GCAGT CAT GCAAT C C AGCAGT GT G CTGCCT GAGCT AGC CAAC C GCACACG GCT CT T CT C A 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


541 


GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 


Qy 


601 


CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCTyVGATCTACCACAGTTGCTTCTTT 


660 






1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | 




Db 


601 


GT CT GT GAT GAACGCT GGGCAGAT GAC CT CTAT C C CAAGAT CT ACCACAGTT GCTTCT T T 


660 


Qy 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 


Qy 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 






1 1 | | | 1 1 1 i i i i j t i i j i i i i i i ■ i i i i i i i ■ i i i i i , * , , , , , * ,t i * > ■ * • . 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 


Qy 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 






L 1 1 1 1 I | \ 1 1 1 1 I | | 1 1 t 1 1 1 1 I t 1 1 1 1 1 t J 1 t 1 1 1 1 I I t Illlllliaiti 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 


Qy 


841 


CGCGCCTT CCT GGCT GAAGT GAAGCAGAT GCGTGCAC GGAGGAAGACAGCCAAGAT GCT G 


900 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


841 


C GC GC CTT C CT GGCT GAAGT GAAGCAGAT GC GTGCACGGAGGAAGAC AGC CAAGAT GCT G 


900 


Qy 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 



1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 



I 



Db 



901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 



Qy 



Db 



961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 



Qy 



Db 



1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1021 AC CT T CTC C CACT GG CT GGT GT AC GC CAACAGC GCT GC CAAC C C CAT CAT CT ACAACTT C 1080 



Qy 



1081 CTCAGTGGC 1089 



Db 



1081 CTCAGTGGC 1089 



RESULT 4 
US-08-846-705-4 

; Sequence 4, Application US/08846705 

; Patent No. 5935814 

; GENERAL INFORMATION: 

APPLICANT: BERGSMA, DERK J. 
APPLICANT: ELLIS, CATHERINE E 
TITLE OF INVENTION: NOVEL G-PROTEIN COUPLED 
; NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: RATNER & PRESTIA 
STREET: P.O. BOX 980 
CITY: VALLEY FORGE 
STATE: PA 
COUNTRY: USA 
ZIP: 19482 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 
; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/846,705 

FILING DATE: 30-APR-1997 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 
; FILING DATE: 

; ATTORNEY/AGENT INFORMATION: 
; NAME: PRESTIA, PAUL F 

REGISTRATION NUMBER: 23,031 ... 

REFERENCE/ DOCKET NUMBER: GH-70003 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-407-0700 

TELEFAX: 610-407-0701 

TELEX: 846169 
; INFORMATION FOR SEQ ID NO: 4: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 1564 base pairs 

; TYPE: nucleic acid 

STRANDEDNESS: single 
; TOPOLOGY: linear 



MOLECULE TYPE: cDNA 
US-08-846-705-4 



Query Match 97.2%; Score 1079.4; DB 2; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 8.7e-248; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I IMM I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 334 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCC.CTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 54 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

,111111111 Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I 

Db 754 GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

INI M III I I I II I I I II I I I I || I || IIIIIMI I I I MM I I Mill 

Db .814 AT T GT CACCT ACCT GGC CC CACT GGGC CT CAT GGC CAT GGC CT AT TT C CAGAT AT TC C GC 873 



Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

MINIMI I N I I I I I I I I I I I II I II II II I I II II II II 



Db 874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

Qy 841 CGCGCCTTCCTGGCT GAAGT GAAGCAGAT GC GT GCAC GGAGGAAGAC AGC CAAGAT GCT G 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 994 CGCGCCTT C GTGGCT GAAGT GAAGCAGAT GCGT GCACGGAGGAAGAC AGC CAAGAT GCT G 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | I I I I I I I I I I I I I I I I I I I | I | | | 
Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I M I I I I I I I I I I I I I I I I I I I I I I I I MINIUM I I II I || I I I I I I 

Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I M I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 1089 

I II I I I I II 
Db 1234 CTCAGTGGC 1242 



RESULT 5 
US-08-846-704-1 

; Sequence 1, Application US/08846704 

; Patent No. 6020157 

; GENERAL INFORMATION: 

APPLICANT: BERGSMA, DERK J. 

APPLICANT: ELLIS, CATHERINE E. 

TITLE OF INVENTION: NOVEL G-PROTEIN COUPLED 

NUMBER OF SEQUENCES: 4 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: RATNER & PRESTIA 
STREET: P.O. BOX 980 
CITY: VALLEY FORGE 
; STATE: PA 

; COUNTRY: USA 

; ZIP: 19482 

COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/846,704 
FILING DATE: 30-APR-1997 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
NAME: PRESTIA, PAUL F 



REGISTRATION NUMBER: 23,031 
; REFERENCE/ DOCKET NUMBER: GH-70002 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 610-407-0700 
TELEFAX: 610-407-0701 
; TELEX: 846169 

; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1564 base pairs 
TYPE: nucleic acid 
; STRANDEDNESS: single 

; TOPOLOGY: linear 

MOLECULE TYPE: cDNA 
US-08-846-704-1 



Query Match 97.2%; Score 1079.4; DB 3; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 8.7e-248; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0 



Qv 


1 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


fid 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 i 1 i i 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 
■ i i i i i i i i i i i i i i i i i i i i i i i i i i j i i i i i \ \ i i i i i i i i i i i i j i [ i i i i i i i i i j 




Db 


154 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


213 


Qv 


61 


TC C C CT GT GC CT C CAGACT AT GAAGAT GAGTTT CT CC GCT AT C T GT GGC<TT C AT T AT CT a 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

i i i ' i i i i I I I I l t 1 1 1 1 1 1 t 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 I I I 1 I 1 1 1 1 1 1 I ! 1 1 1 1 1 1 1 I I 




Db 


214 


TCCCCTGTGCCTCCAGACTATG7UVGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


273 


Qy 


121 


TAC C CAAAAC AGT AT GAGT GG GT C CT CAT CG C AGC CT ATGT GGCT GT GTT C GT C GTGGCC 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 
• * • » ■ ■ ■ ■ ■ ii i i i i * i i i i i i i i \ i i i i i i i i i i i i i \ \ i i i i i i i i i i i i i i i i i i i i 




Db 


274 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


333 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M | | | | | | M 1 1 1 




Db 


334 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


393 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


394 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


453 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


454 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


513 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


514 


GT C AT CC C CT AT C TACAGGCT GTGT CC GT GT CAGT GGC AGT GCTAACT CT CAGC1 x'CAT C 


573 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


574 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


633 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


634 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


693 



Qy 



541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 
I I I I I I I I I HI I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CT CT GT CAT GAACGCTGGGCAGATGACCTCTAT CCCAAGAT CTACCACAGTT GCTT CTTT 660 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 754 GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I 
Db 814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | II I I I I I I I I 
Db 874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

Qy 841 C GC GC CTT C CT GGCTGAAGT GAAGCAGAT GCGT GCAC GGAGGAAGACAG C CAAGAT GCT G 900 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 994 CGCGCCTTCCTGGCT GAAGT GAAGCAGAT GC GT G CAC GGAGGAAGACAGCCAAGAT GCT G 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I | | | 
Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I. I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 1089 

I I I I I I I I I 
Db 1234 CTCAGTGGC 1242 



RESULT 6 
US-08-846-704-3 

; Sequence 3, Application US/08846704 
; Patent No. 6020157 
; GENERAL INFORMATION: 

APPLICANT: BERGSMA, DERK J. 
; APPLICANT: ELLIS, CATHERINE E. 
; TITLE OF INVENTION: NOVEL G-PROTEIN COUPLED 
NUMBER OF SEQUENCES: 4 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: RATNER & PRESTIA 
STREET: P.O. BOX 980 
CITY: VALLEY FORGE 
STATE: PA 
COUNTRY: USA 
ZIP: 19482 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 



COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/84 6,7 04 

FILING DATE: 30-APR-1997 
; CLASSIFICATION: 435 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: PRESTIA, PAUL F 

REGISTRATION NUMBER: 23,031 
; REFERENCE/DOCKET NUMBER: GH-70002 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-407-0700 
; TELEFAX: 610-407-0701 

TELEX: 846169 
; INFORMATION FOR SEQ ID NO: 3: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 1564 base pairs 

; TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-846-704-3 



Query Match 97.2%; Score 1079.4; DB 3; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 8.7e-248; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0 



Qy 


1 


AT GGAG CC CT CAGCCACC C CAGGGGC C CAGAT GG GGGTCCCCCCT GG CAGCAGAGAGC CG 


60 






M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I I 




Db 


154 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


213 


Qy 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 






1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | 




Db 


214 


T C CCCT GT GCCT CC AGACT AT GAAGAT GAGT T T CT CC GCT AT CT GT GGC GT GATT AT CT G 


273 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 M 1 1 1 1 1 1 




Db 


274 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


333 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | II 1 1 




Db 


334 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


393 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


394 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


453 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


454 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


513 



Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | 



Db 514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 754 GT CTGTGAT GAACGCTGGGCAGAT GACCTCTATCCCAAGATCTACCACAGTT GCTT CTTT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 72 0 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 874 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 934 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 993 

Qy 841 C GCGCCTT CCTGGCT GAAGT GAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGAT GCTG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 994 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 1089 

I I I I I I I I I 
Db 1234 CTCAGTGGC 1242 

RESULT 7 
US-08-846-705-3 

; Sequence 3, Application US/08846705 

; Patent No. 5935814 

; GENERAL INFORMATION: 

APPLICANT: BERGSMA, DERK J. 



APPLICANT: ELLIS, CATHERINE E 
; TITLE OF INVENTION: NOVEL G-PROTEIN COUPLED 

NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: RATNER & PRESTIA 

STREET: P.O. BOX 980 
CITY: VALLEY FORGE 
STATE: PA 
COUNTRY: USA 
ZIP: 19482 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/ 08/ 84 6, 1 05 

FILING DATE: 30-APR-1997 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/ AGENT INFORMATION: 
NAME: PRESTIA, PAUL F 
REGISTRATION NUMBER: 23,031 
; REFERENCE/ DOCKET NUMBER: GH-70003 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 610-407-0700 
TELEFAX: 610-407-0701 
TELEX: 846169 
; INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 1133 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-846-705-3 

Query Match 97.2%; Score 1078.4; DB 2; Length 1133; 

Best Local Similarity 99.4%; Pred. No. 1.4e-247; 

Matches 1082; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 . 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db , r« 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 12 0 

II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 42 0 

I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I II I I I I I I I I II I I I I I I I I I I II 

361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 48 0 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 

541 G CAGT CAT GCAAT CCAGCAGT GT GCT GCCT GAG CT AGCCAACC G CAC AC GGCT CTT CT CA 600 

MINIMI III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

601 CT CTGT CAT GAACGCTGGGCAGATGACCT CTAT CCCAAGATCTACCACAGTTGCTT CTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I | | | | | | | | | | | | | | | | | | | | | 
601 GT CTGT GAT GAACGCTGGGCAGATGACCT CTAT CCCAAGATCTACCACAGTTGCTT CTTT 660 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I 

841 C GC GC CT TC CT GGCTGAAGT GAAGCAGAT G C GT G CAC GGAGGAAGACAGC CAAGAT GCT G 900 

901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 



Qy 1081 CTCAGTGG 1088 

I I I I I I I I 
Db 1081 CTCAGTGG 1088 



RESULT 8 
US-08-846-705-1 

; Sequence 1, Application US/08846705 

; Patent No. 5935814 

; GENERAL INFORMATION : 

; APPLICANT: BERGSMA, DERK J. 

APPLICANT: ELLIS, CATHERINE E 
; TITLE OF INVENTION: NOVEL G- PROTEIN COUPLED 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: RATNER & PRESTIA 

STREET: P.O. BOX 980 

CITY: VALLEY FORGE 

STATE: PA 

COUNTRY: USA 

ZIP: 19482 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 
; COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ' for Windows Version 2.0 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/846, 705 

; FILING DATE: 30-APR-1997 

; CLASSIFICATION: 435 

; PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/ AGENT INFORMATION: 

NAME: PRESTIA, PAUL F 
; . REGISTRATION NUMBER: 23,031 

REFERENCE/DOCKET NUMBER: GH-70003 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-407-0700 

TELEFAX : 610-407-0701 

TELEX: 846169 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1170 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
; MOLECULE TYPE: cDNA 
US-08-846-705-1 

Query Match 97.2%; Score 1078.4; DB 2; Length 1170; 

Best Local Similarity 99.4%; Pred. No. 1.4e-247; 

Matches 1082; Conservative 0; Mismatches 6; Indels 0; Gaps 0 



Qy 



1 % ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 
. I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

61 T C C C CT GT GCCT C CAGACT AT GAAGAT GAGT T TCTC C GCT AT CT GT GGC GT GAT TAT CT G 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

61 TCCCCTGTGCCTC CAGACT AT GAAGAT GAGTT T CT C C GC TAT CT GT GGC GT GATT AT CTG 12 0 

121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

301- CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

361 GT CAT C C C CT AT CT ACAGGCT GT GT C C GTGT C AGT GGCAGTGCT AACTCT CAGCT T CAT C 420 

I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

421 GC C CT GGAC C GCT GGT AT G C C ATCT GC CACC CACT ATT GT T CAAGAG CAC AGC C C GGC GG 48 0 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

601 CT CT GT CAT GAAC GCT GGGC AGAT GAC CTCTAT C C CAAGAT CT AC C ACAGT TGCTTCTTT 660 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | 

601 GT CT GT GAT GAAC GCT GGGCAGAT GAC CTCTAT CCCAAGAT CTAC CACAGT T GCTT CTT T 660 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I I I I I I I 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

781 CCCTCAGAC CAGCT GGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

841 CGCGCCTTCCTGGCTGAAGT GAAGCAGAT GCGT GCACGGAGGAAGACAGCCAAGATGCT G 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I I I I I 

841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 



Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

Qy 1081 CTCAGTGG 1088 

I I I I I I I I 

Db 1081 CTCAGTGG 1088 



RESULT 9 

PCT-US95-05616-1 

; Sequence 1, Application PC/TUS9505616 
; GENERAL INFORMATION: 

APPLICANT: LI, ET AL. 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 
; ADDRESSEE: CECCHI, STEWART & OLSTEIN 

; STREET: 6 BECKER FARM ROAD 

CITY: ROSELAND 
; STATE: NEW JERSEY 

COUNTRY: USA 
ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 
COMPUTER: IBM PS/2 
OPERATING SYSTEM: MS-DOS 
SOFTWARE: WORD PERFECT 5.1 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: PCT/US95/05616 

; FILING DATE: concurrently 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 
NAME: FERRARO, GREGORY D. 
REGISTRATION NUMBER: 36,134 
REFERENCE/ DOCKET NUMBER: 325800-268 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 201-994-1700 
TELEFAX: 201-994-1744 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1209 BASE PAIRS 
TYPE: NUCLEIC ACID 
STRANDEDNESS: SINGLE 
TOPOLOGY: LINEAR 
MOLECULE TYPE: cDNA 
PCT-US95-05616-1 



Query Match 97.1%; Score 1077.8; DB 5; Length 1209; 

Best Local Similarity 99.4%; Pred. No. 1.9e-247; 

Matches 1082; Conservative 0; Mismatches 7; Indels 0; Gaps 0; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I LI I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 TCCCCTGTGCCTC CAGACT AT GAAGAT GAGTT T CT C CGCT AT CT GT GGC GT GATTAT CT G 120 

Qy 121 TAGCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I 
Db 121 TACCCAAAACAGTATGAGTGGGTCCTCATCCCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 GC CCT GGAC C GCT GGT AT GC CAT CTGC CAC C CACTAT TGT T CAAGAG CACAGC C C GG C GG 480 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I III II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

Qy 601 CT CT GT CAT GAAC GCT GGGCAGAT GAC CT CT AT C CCAAG AT CT ACCACAGT TGCTTCTTT 660 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 GT CT GT CAT GAAC GCT G GGCAGAT GAC CT CTAT C CCAAGAT CT AC CACAGT TGCTTCTTT 660 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 721 AACCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 



Qy 

Db 



781 
781 



840 
840 



Qy 



841 CGCGCCTTCCTGGCT GAAGT GAAGCAGAT GCGT GCAC GGAGGAAGACAGCCAAGAT GCT G 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 



Db 



901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 





Db 



901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 



Qy 



Db 



961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 



Qy 



Db 




Qy 



1081 CTCAGTGGC 1089 

I I I I I I I I I 
1081 CTCAGTGGC 1089 



Db 



RESULT 10 
US-08-462-509B-5 

; Sequence 5, Application US/08462509B 
; Patent No. 6410701 
; GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 
; , NUMBER OF SEQUENCES : 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 

CITY: Rockiville 

STATE : MD 

COUNTRY: USA 

ZIP: 20850 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/462, 509B 

FILING DATE: 05-JUN-1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/US95/05616 

FILING DATE: 05-MAY-1995 
; ATTORNEY/AGENT INFORMATION: 

NAME: Wales, Michele M. 

REGISTRATION NUMBER: 43,975 

REFERENCE/DOCKET NUMBER: PF168P1 
TELECOMMUNICATION INFORMATION: 



TELEPHONE: 301-309-8504 
TELEFAX: 301-309-8439 

; INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1116 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 

; FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1..1116 

US-08-462-509B-5 



Query Match 96.9%; Score 1075.2; DB 4; Length 1116; 

Best Local Similarity 99.3%; Pred. No. 7.9e-247; 

Matches 1080; Conservative 0; Mismatches 8; Indels 0; Gaps 0; 



Qy 


1 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 






1 1 1 1 i i i i i i i i i i i i i [ i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i t i t 'i ii 
1 II 1 M 1 1 II M 1 1 II II 1 M II II II 1 II 1 II 1 1 M II 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 II 






1 

J. 


j\i llalllli LALLLALLLLALLLLLLLALA1 LLLLL1 LllCCCTGGCAGCAGAGACCCC 


60 


Qy 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 






I t 1 1 1 1 1 t 1 1 1 1 1 i 1 1 1 1 1 1 ! 1 1 1 t 1 1 1 1 1 1 I 1 1 1 1 1 l l l l I i l i i i i i i i i i i i i i i i i 

II 1 1 M M M 1 1 1 II II 1 1 1 i II II 1 II 1 1 II 1 II II 1 II I I || | | | | 1 | 1 | || | | | Ml 






u ± 


1 L.^LLi Ll LLLl LLALAL 1 Al LAALA1 LALi 1 1 LI LLLLI Al Li Ll LLLLi GATTATCTG 


120 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 i 1 1 1 1 1 1 1 1 1 1 1 i t 1 i 1 t 1 1 1 I 1 1 1 1 1 1 1 l l l l l I I I I I I I I I I I I i i i i i i t i > i i i 
1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


1 A^^^/\rt/Vi\^Au 1. /\i KjJ-\Kj 1 LjoLI LL1 LAI LLLALLLlAi LI LLL 1 LI LI 1 LL1 LL1 LLLL 


ion 
lb U 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 

1 1 1 1 >> 1 II 1 I II II II I M II 1 II II M II II II I I 1 M M | M M M M M | | | | | | | 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCT^GCC^T^Tnnr^^AArrArrAraTrnrr ArarTr 


*i U 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 




Db 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | | | 




Db 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 




Db 


421 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 > 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 


Qy 


541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 






1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | I | | | | | | | | | | | | | | | 




Db 


541 


GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 



Qy 601 CT CT GT CAT GAAC GCT GGGCAGAT G AC CT CT AT C C CAAGAT CT AC CACAGTT GCTT CT TT 660 

Mill I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 GT C T GT GAT GAAC GCT G GGC AGATGAC CT CT AT CC CAAGAT CT AC CACAGTT G CTT CT TT 660 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 72 0 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

Qy 841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

■ Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTG.CCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

Qy 1081 CTCAGTGG 1088 

I II II I II 
Db 1081 CTCAGTGG 1088 



RESULT 11 
PCT-US95-05616-5 

; Sequence 5, Application PC/TUS9505616 
; GENERAL INFORMATION: 

APPLICANT: LI, ET AL. 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE : CARELLA, BYRNE, BAIN, GILFILLAN, 
ADDRESSEE: CECCHI, STEWART & OLSTEIN 
STREET: 6 BECKER FARM ROAD 
CITY: ROSELAND 
; STATE: NEW JERSEY 

COUNTRY: USA 
ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 
COMPUTER: IBM PS/2 
; OPERATING SYSTEM: MS-DOS 



SOFTWARE: WORD PERFECT 5.1 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95/05616 

FILING DATE: concurrently 

CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 

NAME: FERRARO, GREGORY D. 

REGISTRATION NUMBER: 36,134 

REFERENCE/DOCKET NUMBER: 325800-268 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 201-994-1700 

TELEFAX: 201-994-1744 
; INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1133 BASE PAIRS 

TYPE: NUCLEIC ACID 

STRANDEDNESS: SINGLE 
; TOPOLOGY: LINEAR 

MOLECULE TYPE: cDNA 
PCT-US95-05616-5 

Query Match 96.9%; Score 1075.2; DB 5; Length 1133; 

Best Local Similarity 99.3%; Pred. No. 7.9e-247; 

Matches 1080; Conservative 0; Mismatches 8; Indels 0; Gaps , 0 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II 
Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGACCCC 60 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | 
Db 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 ACC7\ACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTerGCAAG 360 

I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I 
Db 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 



Qy 

Db 



481 
481 



GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 
I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 



Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

Qy 601 CT CTGT CATGAACGCT GGGCAGAT GACCTCT ATCCCAAGAT CTACCACAGTT GCTT CTTT 660 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 GT CT GT GAT GAAC GCT GGGCAGAT GAC CT CT AT CC CAAGAT CTAC C ACAGT T GCT T CTTT 660 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 7 80 

I II I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | I I I I I I I 
Db 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

Qy 7 81 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

Qy 841 CGCGCCTTCCTGGCT GAAGT GAAGCAGAT G C GT GCAC GGAGGAAGACAGC CAAGAT GCTG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 841 CGCGCCTTCCTGGCT GAAGT GAAGCAGATGC GT GC AC GGAG GAAGACAGC CAAGAT GCT G 900 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

Qy 1081 CTCAGTGG 1088 

I I I I I I II 
Db 1081 CTCAGTGG 1088 



RESULT 12 , S;? 
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APPLICATION NUMBER: JP 6-326611 
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PRIOR APPLICATION DATA: 
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PRIOR APPLICATION DATA: , 

APPLICATION NUMBER: JP 6-189273 
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; TELEFAX: 617-523-6440 

; INFORMATION FOR SEQ ID NO: 375: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 843 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 28. .816 
US-08-513-974B-375 

Query Match 62.0%; Score 688; DB 3; Length 843; 

Best Local Similarity 89.2%; Pred. No. le-154; 

Matches 742; Conservative 0; Mismatches 90; Indels 0; Gaps 0 

Qy 252 CATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCT 311 

I I Ml I I I I I I I I I I I II II I I I I I I I I II I I I I II I I I I I I I I I I I II 
Db 9 CGTGTTCATCCTGTCACTGGCCGATGTGCTGGTGACAGCCATCTGCCTGCCGGCCAGTCT 68 

Qy 312 GCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTA 371 

I M I I I I I I I I I II II I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 69 GCTGGTAGACATCACGGAATCCTGGCTCTTTGGCCATGCCCTCTGCAAGGTCATCCCCTA 128 

Qy 372 TCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCG 431 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 129 TCTACAGGCCGTGTCCGTGTCAGTGGTCGTGCTGACTCTCAGCTCCATCGCCCTGGACCG 188 

Qy 432 CTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTC 491 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I II I I I I I I I I I I I Mill 
Db 189 CTGGTACGCCATCTGCCACCCGCTGTTGTTCAAGAGCACTGCCCGGCGCGCCCGCGGCTC 248 

Qy 4 92 CATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCA 551 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I II I I I I I I I I I II I I I I I 
Db 24 9 CATCCTCGGCATCTGGGCGGTGTCGCTGGCTGTCATGGTGCCTCAGGCTGCTGTCATGGA 308 

Qy 552 ATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGA 611 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I II I II I I I I I I | I I 
Db 309 GTGTAGCAGCGTGCTGCCCGAGCTGGCCAACCGCACCCGCCTCCTGTCTGTCTGTGATGA 368 

Qy . 612 ACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTTATTGTCACCTA 671 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I 
Db 369 GC GCTGGGCAGACGAC CT GT AC C CCAAGAT C TAC CACAGCT GCTT CT T CAT T GT CAC CT A 428 

Qy 672 CCTGGChcCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGG 731 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 429 CCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATCTTCCGCAAGCTCTGGGG 488 

Qy 732 CCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCA 791 

I I I I I I I I I I I I I I I I I I I I I I I I II II I I II I I I I I I I I I I I I I I I I I I I II I I I 
Db 489 CCGCCAGATCCCCGGCACCACCTCGGCCCTGGTGCGCAACTGGAAGCGGCCCTCAGACCA 548 

Qy 792 GCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCT 851 

I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 54 9 GCTGGACGACCAGGGCCAGGGCCTGAGCTCAGAGCCCCAGCCCCGGGCCCGCGCCTTCCT 608 



Qy 852 G G C T GAAGT GAAGC AGAT GC GT GC AC GGAGGAAGACAGC CAAGAT G CT GAT GGT GGT GC T 911 

I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I II M I I 
Db 609 GGC C GAGGT GAAACAGAT GC GAGC C C GGAGGAAGAC GGCCAAGAT GCT GAT G GT GGT GCT 668 

Qy 912 GCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTT 971 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 669 GCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGTGTCCTCAACGTCCTCAAGAGGGTCTT 728 

Qy 972 CGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCA 1031 

I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 72 9 CGGGATGTTCCGCCAAGCCAGCGACCGAGAGGCCATCTACGCCTGCTTCACCTTCTCCCA 788 

Qy 1032 CTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTCCTC 1083 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II I I I I I I I I I I I I 
Db 789 CTGGCTGGTGTACGCCAACAGCGCCGCCAATCCCCTCCTCTACTCCTTCCTC 8 40 
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US-08-513-974B-55 

Query Match 59.5%; Score 661; DB 3; Length 789; 

Best Local Similarity 89.9%; Pred. No. 2.6e-148; 

Matches 709; Conservative 0; Mismatches 80; Indels 0; Gaps 0; 



Qy 271 GCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGGACATCACTGAG 330 

II II II I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I II 
Db 1 GCCGAT GT GCT GGTGACAGCCAT CT GCCTS'CCGGCCAGT CTGCT GGTAGACAT CACGGAA 60 

Qy 331 TCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGGCTGTGTCCGTG 390 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 61 TCCTGGCTCTTTGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGGCCGTGTCCGTG 120 

Qy 391 TCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATGCCATCTGCCAC 450 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 TCAGTGGTCGTGCTGACTCTCAGCTCCATCGCCCTGGACCGCTGGTACGCCATCTGCCAC 180 

Qy 451 CCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGGGCATCTGGGCT 510 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 



Db 



181 CCGCTGTTGTTCAAGAGCACTGCCCGGCGCGCCCGCGGCTCCATCCTCGGCATCTGGGCG 240 



Qy 511 GTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCAGTGTGCTGCCT 570 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GTGTCGCTGGCTGTCATGGTGCCTCAGGCTGCTGTCATGGAGTGTAGCAGCGTGCTGCCC 300 

Qy 571 GAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGGCAGATGACCTC 630 

I I I I I I I I I I I I I I I I II III I II I I I I I I II I I I I I I I I I I I I I I II I 

Db 301 GAGCTGGCCAACCGCACCCGCCTCCTGTCTGTCTGTGATGAGCGCTGGGCAGACGACCTG 360 

Qy 631 TATCCCAAGATCTACCACAGTTGCTTCTTTATTGTCACCTACCTGGCCCCACTGGGCCTC 690 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 TACCCCAAGATCTACCACAGCTGCTTCTTCATTGTCACCTACCTGGCCCCACTGGGCCTC 420 

Qy 691 ATGGCCATGGCCTATTTCGAGATATTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACC 750 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ATGGCCATGGCCTATTTCCAGATCTTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACC 480 

Qy 751 ACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGGACCTGGAGCAG 810 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II Ml 
Db 481 ACCTCGGCCCTGGTGCGCAACTGGAAGCGGCCCTCAGACCAGCTGGACGACCAGGGCCAG 540 

Qy 811 GGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAGTGAAGCAGATG 870 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I 

Db 541 GGCCTGAGCTCAGAGCCCCAGCCCCGGGCCCGCGCCTTCCTGGCCGAGGTGAAACAGATG 600 

Qy 871 CGTGCACGGAGGAAGACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGC 930 

II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 CGAGCCCGGAGGAAGACGGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGC 660 

Qy 931 TACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGTTCCGCCAAGCC 990 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 TACCTGCCCATCAGTGTCCTCAACGTCCTCAAGAGGGTCTTCGGGATGTTCCGCCAAGCC 720 

Qy 991 AGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAAC 1050 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 AGCGACCGAGAGGCCATCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAAC 780 

Qy 1051 AGCGCTGCC 1059 

Mill III 
Db 781 AGCGCCGCC 789 
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US-09-461-436B-55 

; Sequence 55, Application US/09461436B 
; Patent No. 6538107 

GENERAL INFORMATION: 

APPLICANT: Shuji Hinuma 
; Yasuaki Ito 

; Ryo Fujii 

TITLE OF INVENTION: G Protein Coupled Receptor Protein, 
; Production, And Use Thereof 

NUMBER OF SEQUENCES: 61 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Edwards & Angell, LLP 
STREET: 101 Federal Street 



CITY: BOSTON 
; STATE: MA 

COUNTRY: USA 

ZIP: 02209 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/461, 436B 

FILING DATE: 14-Dec-1999 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/513,974 

FILING DATE: 14-SEP-1995 

APPLICATION NUMBER: PCT/ JP95/01599 

FILING DATE: 10-AUG-1995 
; APPLICATION NUMBER: 7-093989 

; FILING DATE: 19-APR-1995 

APPLICATION NUMBER: 7-057186 

FILING DATE: 16-MAR-1995 

APPLICATION NUMBER: 7-007177 

FILING DATE: 20-JAN-1995 

APPLICATION NUMBER: 6-326611 

FILING DATE: 28-DEC-1994 

APPLICATION NUMBER: 6-270017 
; FILING DATE: 02-NOV-1994 

APPLICATION NUMBER: 6-236357 
; FILING DATE: 30-SEP-1994 

APPLICATION NUMBER: 6-236356 

FILING DATE: 30-SEP-1994 

APPLICATION NUMBER: 6-189274 

FILING DATE: ll-AUG-1994 

APPLICATION NUMBER: 6-189273 
; FILING DATE: ll-AUG-1994 

APPLICATION NUMBER: 6-189272 

FILING DATE: ll-AUG-1994 
ATTORNEY/AGENT INFORMATION: 

NAME : CONLIN, DAVID G. 
; REGISTRATION NUMBER: <Unknown> 

REFERENCE/ DOCKET NUMBER: 45753 DIV2 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 617-439-4444 

TELEFAX: 617-439-4170 
; INFORMATION FOR SEQ ID NO: 55: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 789 base pairs 

TYPE: nucleic acid 
; STRANDEDNESS: double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
US-09-461-436B-55 



Query Match 59.5%; Score 661; DB 4; Length 789; 

Best Local Similarity 89.9%; Pred. No. 2.6e-148; 



Matches 709; Conservative 0; Mismatches 80; Indels 0; Gaps 0; 

Qy 271 GCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGGACATCACTGAG 330 

M M M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1 G CC GAT GT GC T GGT GAC AGCC AT CTGCCTGCCGGC CAGT CTGCT GGT AGAC AT CAC GGAA 60 

Qy 331 TCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGGCTGTGTCCGTG 390 

I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 TCCTGGCTCTTTGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGGCCGTGTCCGTG 120 

Qy 391 TCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATGCCATCTGCCAC 450 

I I I I I I I till! I I I I I I I I I I II I I I I I I I I I I I I I II I II I I I I I I I I I II I 
Db 121 TCAGTGGTCGTGCTGACTCTCAGCTCCATCGCCCTGGACCGCTGGTACGCCATCTGCCAC 18 0 

Qy 451 CCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGGGCATCTGGGCT 510 

M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 CCGCTGTTGTTCAAGAGCACTGCCCGGCGCGCCCGCGGCTCCATCCTCGGCATCTGGGCG 240 

Qy 511 GTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCAGTGTGCTGCCT 570 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 241 GTGTCGCTGGCTGTCATGGTGCCTCAGGCTGCTGTCATGGAGTGTAGCAGCGTGCTGCCC 300 

Qy 571 GAGCTAGCCAACCGCACACGGCT CTTCT CACT CT GT CATGAACGCTGGGCAGATGACCTC 630 

I I I I I I I I I I II I I II II III I II Mill MM I I M II I II M I II I I 

Db 301 GAGCTGGCCAACCGCACCCGCCTCCTGTCTGTCTGTGATGAGCGCTGGGCAGACGACCTG 360 

Qy 631 TATCCCAAGATCTACCACAGTTGCTTCTTTATTGTCACCTACCTGGCCCCACTGGGCCTC 690 

M I I I I I II I I I II I I II I I I I I I II I II I II I I I I I I II M I I I II I I II I I I I M 
Db 361 TACCCCAAGATCTACCACAGCTGCTTCTTCATTGTCACCTACCTGGCCCCACTGGGCCTC 420 

Qy 691 ATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACC 750 

M I I I I II I I I I I I I I II I I I II M I I I I I I I I I I I I I I || I I I I I || I II I I I II II I 
Db 421 ATGGCCATGGCCTATTTCCAGATCTTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACC 480 

Qy 751 ACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGGACCTGGAGCAG 810 

M II I II M I I I I I I I I I I I II I I II I I I M I I I I I I M I I I I I II II III 
Db 481 ACCTCGGCCCTGGTGCGCAACTGGAAGCGGCCCTCAGACCAGCTGGACGACCAGGGCCAG 540 

Qy 811 GGCCTGAGTGGAGAGCCCCAGcdcCGGGGCCGCGCCTTCCTGGCTGAAGTGAAGCAGATG 870 

I M I I I I I I I II II I I I I I I I I I II I I II I I I I II I II I I II I I I I I I II I I I 
Db 541 GGCCTGAGCTCAGAGCCCCAGCCCCGGGCCCGCGCCTTCCTGGCCGAGGTGAAACAGATG 600 

Qy 871 CGTGCACGGAGGAAGACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGC 930 

M II II I I I I I I I II I I I I I II I II I I I I I I I M I I I I I I || I I I I I I I I I I I II I I 
Db 601 CGAGCCCGGAGGAAGACGGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGC 660 

Qy 931 TACCTCCCCATCAGCGTCCTCAATGTCCTT/yVGAGGGTGTTCGGGATGTTCCGCCAAGCC 990 

M II I I I I II I II II I I II I I II I I I I II M II I II M I I I.I I II II II I I I II I 
Db 661 TACCTGCCCATCAGTGTCCTCAACGTCCTCAAGAGGGTCTTCGGGATGTTCCGCCAAGCC 720 

Qy 991 AGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAAC 1050 

M Mill II II I II I II I II II I I II || I I || || || I I II I I I II II I I II I I II 
Db 721 AGCGACCGAGAGGCCATCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAAC 780 

Qy 1051 AGCGCTGCC 1059 

Mill Ml 
Db 781 AGCGCCGCC 78 9 



RESULT 15 
US-09-119-788-1 

; Sequence 1, Application US/09119788 
; Patent No. 6166193 
; GENERAL INFORMATION: 

APPLICANT: Yanagisawa, Masashi 

TITLE OF INVENTION: CDNA CLONE MY1 THAT ENCODES 
; TITLE OF INVENTION: A NOVEL HUMAN 7 -TRANSMEMBRANE RECEPTOR 

; NUMBER OF SEQUENCES: 2 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: SmithKline Beecham Corporation 

; STREET: 709 Swedeland Road 

CITY: King of Prussia 
STATE: PA 

COUNTRY: United States of America 
; ZIP: 19406 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ for Windows Version 2.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/119,788 

FILING DATE: 21-JUL-1998 

CLASSIFICATION: 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/053,790 

FILING DATE: 25-JUL-1997 
ATTORNEY/AGENT INFORMATION: 

NAME: King, William T 

REGISTRATION NUMBER: 30,954 

REFERENCE/ DOCKET NUMBER: GH50029 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-270-5515 

TELEFAX: 610-270-5090 
; TELEX: 

; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1633 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: Genomic DNA 
US-09-119-788-1 . 

Query Match 46.1%; Score 511.8; DB 3; Length 1633; 

Best Local Similarity 70.0%; Pred. No. le-112; 

Matches 706; Conservative 0; Mismatches 297; Indels 6; Gaps 1 

Qy 80 ATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTGTACCCAAAACAGTATGAGT 139 

I I I I I I I III I I I I I I I I I I I I I I I I I I I | | | | | 

Db 217 AC GAC GAGGAATT C CT GCGGT AC CT GT GGAGGGAAT AC CT GCAC C CGAAAGAAT AT GAGT 27 6 



Qy 14 0 GGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCCCTGGTGGGCAACACGCTGG 199 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I 



277 GGGTCCTGATCGCCGGGTACATCATCGTGTTCGTCGTGGCTCTCATTGGGAACGTCCTGG 336 

200 TCTGCCTGGCCGTGTGGCG GAAC C AC C AC AT GAG GACAGT CAC CAACT ACTT CAT T GT CA 259 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
337 TTTGT GT GGCAGT GTGGAAGAACCACCACATGAGGACGGTAACCAACTACTT CATAGTCA 396 

260 ACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGG 319 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

397 ATCTTTCTCTGGCTGATGTGCTCGTGACCATCACCTGCCTTCCAGCCACACTGGTCGTGG 456 

320 ACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGG 379 

I I I I I I I I I I I I I I I I II II II I I I I I I I I I II II II I I I I I I I I I 
457 AT AT CACT GAGAC CT GGT T TTT TGGACAGT C C CT T T GCAAAGT GAT T C CTTAT CT ACAGA 516 

380 CTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATG 439 

I I I I I I I I I I I III I II II II II I II I III II I I I I II I I I I I I I 
517 CCGTGTCGGTGTCTGTGTCTGTCCTCACACTGAGCTGTATCGCCTTGGATCGGTGGTATG 576 

44 0 CCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGG 4 99 

I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I III I 
577 CAAT CT GT CAC C C T T T GAT GTT T AAGAGCACAGCAAAGC G GGC CCGT AACAGCAT T GT CA 636 

500 GCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCA 559 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

637 TCATCT GGATTGT CT CCTGCATTATAAT GATTCCTCAGGCCATCGT CAT GGAGTGCAGCA 696 

560 GTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGG 619 

I I I I I I I I I I I I II II II I I I I I I I I I I I I I I I I I I I 

697 CCGTGTTCCCAGGCTTAGCCAATAAAACCACCCTCTTTACGGTGTGTGATGAGCGCTGGG 756 

62 0 CAGAT GAC CT CT AT C C CAAGAT CTAC C ACAGTT GCT T CTT T ATT GT C AC CT ACCT GG CCC 679 

I III I I I I I I I I I I I I I I I I I I I II I I I I I I I II II III II I I I 

757 GT GGT GAAATTT AT C C CAAGAT GT ACC ACAT CT GTT T CTT T CT GGT GACAT ACAT GGCAC 816 

680 CACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGA 739 

I I I I I I I I I I I I I I I I I III I II I I I I I I I I I I I I I I I I I II I I I I 
817 CACTGTGTCTCATGGTGTTGGCTTATCTGCAAATATTTCGCAAACTCTGGTGTCGACAGA 876 

740 TCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGG 799 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

877 TCCCTGGAACATCATCTGTAGTTCAGAGAAAATGGAAGCCCC TGCAGCCTGTTT 930 

800 ACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAG 859 

II I Ml I I I I I I I I I I I I I I I I I I I I I 

931 CACAGCCTCGAGGGCCAGGACAGCCAACGAAGTCCCGGATGGGCGCTGTGGCGGCTGAAA 990 

860 TGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTGAT GGT GGT GCT GCT GGT CT 919 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

991 TAAAGCAGATCCGAGCCAGAAGGAAAACAGCCCGGATGTTGATGGTTGTGCTTTTGGTAT 1050 

920 TCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGT 979 
I M I I I M I M II II III I I I I I I I I I I II I I I I I II II I I I I I I I 
1051 T T GCAAT TT GCT AT CTAC CAAT T AGCAT C CTCAAT GT GCTAAAGAGAGT AT TT GGGAT GT 1110 

980 TCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGG 1039 
I III I I I I I I I I I I I II I I I I I II I I I II I I I I I II I I I I 

1111 T T GC C CAT ACT GAAGACAGAGAGACT GT GT ATGC CT GGTT T ACCTTT T CACACT GGCT T G 1170 



Qy 1040 T GTAC GC CAACAGC GCT GCCAAC C C CAT CAT C T ACAACTT C CTCAGT GG 1088 

I II I I I I I II I I I I I II II II II II II II I I I I I I II 
Db 1171 TAT AT GC CAAT AGT GCT GCGAAT C CAAT T ATT T ATAATTTT CT CAGT G G 1219 

Search completed: October 15, 2004, 22:55:08 
Job time : 90.0094 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



October 15, 2004, 19:59:43 ; Search time 572.496 Seconds 

(without alignments) 
9829.265 Million cell updates/se 

US-10-070-532-3 
1110 

1 atggagccctcagccacccc ttccctggagtctgctctaa 1110 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



3340653 seqs, 2534783454 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



6681306 



Database : 



Published_Applications_NA: * 

1: /cgn2_6/ptodata/l/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/l/pubpna/PCT_NEW_PUB. seq: * 

3: /cgn2_6/ptodata/l/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/l/pubpna/US06_PUBCOMB. seq: * 

5 : /cgn2_6/ptodata/l/pubpna/US07_NEW_PUB. seq: * 

6 : / cgn2_6/ptoda ta/ 1 /pubpna/ PCTUS_PUBCOMB .seq:* 

7 : /cgn2_6/ptodata/l/pubpna/US08_NEW_PUB.seq: * 

8 : /cgn2_6/ptodata/l/pubpna/US08_PUBCOMB. seq: * 

9 : /cgn2_6/ptodata/l/pubpna/US09A_PUBCOMB. seq: * 
10: /cgn2_6/ptodata/l/pubpna/US09B_PUBCOMB. seq: * 
11 : /cgn2_6/ptodata/l/pubpna/US09C_PUBCOMB.seq: * 
12 : /cgn2_6/ptodata/l/pubpna/US09_NEW_PUB. seq: * 
13 : /cgn2_6/ptodata/l/pubpna/US09_NEW_PUB . seq2 : * 
14 : /cgn2_6/ptodata/l/pubpna/US10A_PUBCOMB. seq: * 
15 : /cgn2_6/ptodata/l/pubpna/US10B_PUBCOMB . seq: * 
16: /cgn2_6/ptodata/l/pubpna/US10C_PUBCOMB.seq^- 
17 : /cgn2_6/ptodata/l/pubpna/US10_NEW_PUB. seq: * 
18 : /cgn2_6/ptodata/l/pubpna/US60_NEW_PUB. seq: * 
19 : /cgn2_6/ptodata/l/pubpna/US60_PUBCOMB. seq: * 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. 


Score 
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Length 


DB 


ID 
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US-09-393-696-3 


Sequence 3, Appli 
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US-10-077-874-3 


Sequence 3, Appli 
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US-10-225-567A-367 


Sequence 367, App 
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Sequence 21, Appl 
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10 


US-09-393-696-1 


Sequence 1, Appli 


6 


1077.8 
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1209 


14 


US-10-077-874-1 


Sequence 1, Appli 
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1077.8 


97 . 


1 


1564 


9 


US-09-828-538-23 


Sequence 23, Appl 


8 


1075.2 


y b . 


y 


1116 


14 


US-10-077-874-5 


Sequence 5, Appli 


9 


1075.2 


9b . 


y 


1133 


10 


US-09-393-.696-5 


Sequence 5, Appli 


10 


1074.6 


96 . 


8 


1278 


11 


US-09-826-509-548 


Sequence 54 8, App 


11 
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0 1 . 


D 
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US-09-730-931-1 


Sequence 1, Appli 
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US-10-278-087A-55 


Sequence 55, Appl 
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4 D . 
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US-10-282-717-1 


Sequence 1, Appli 
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A /I 
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US-10-225-567A-369 


Sequence 369, App 
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11 


US-09-826-509-550 


Sequence 550, App 
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2 1 . 
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/ 


9785 


9 


US-09-961-848-1 


Sequence 1, Appli 


17 


241.2 


21 . 


1 


10453 


9 


US-09-828-538-21 


Sequence 21, Appl 


18 


236 


21 . 


3 


244 


9 


US-09-828-538-6 


Sequence 6, Appli 


19 


223.8 


20 . 


2 


227 


9 


US-09-828-538-10 


Sequence 10, Appl 


20 


199 


17 . 


9 


356 


9 


US-09-828-538-2 


Sequence 2, Appli 


21 


179 


16 . 


1 


179 


9 


US-09-828-538-4 


Sequence 4, Appli 


22 


161.6 


14 . 


6 


1290 


17 


US-10-719-587-55 


Sequence 55, Appl 


23 


161.6 


14 . 


6 


1290 


17 


US-10-719-587-56 


Sequence 56, Appl 


24 


161.6 


14 . 


6 


1293 


9 


US-09-866-248A-7 


Sequence 7, Appli 


25 


161.6 


14 . 


6 


1293 


15 


US-10-225-567A-657 


Sequence 657, App 


26 


161.6 


14 . 


(Z 
D 


1320 


9 


US-09-292-973-3 


Sequence 3, Appli 


27 


154.6 


13 . 


9 


1299 


17 


US-10-719-587-38 


Sequence 38, Appl 


28 


154.6 


13. 


9 


1410 


9 


US-09-866-248A-1 


Sequence 1, Appli 


29 


148.2 


13 . 


4 


1532 


9 


US-09-292-973-1 


Sequence 1, Appli 


30 


126 


11. 


4 


977 


12 


US-09-876-143-1561 


Sequence 1561, Ap 


31 


122 


11. 


0 


122 


9 


US-09-828-538-12 


Sequence 12, Appl 


32 


120.4 


10. 


8 


168575 


15 


US-10-178-194-1 


Sequence 1, Appli 


33 
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10. 


7 


1174 


17 


US-10-611-210-9 


Sequence 9, Appli 


34 


117.4 


10. 


6 


1110 


14 


US-10-044-592-17 


Sequence 17, Appl 


35 


117.4 


10. 


6 


1110 


15 


US-10-278-087A-31 


Sequence 31, Appl 


36 


117.4 


10.6 


1113 


15 


US-10-225-567A-244 


Sequence 244, App 


37 


117.4 


10. 


6 


1152 


17 


US-10-611-210-10 


Sequence 10, Appl 


38 


117.4 


10. 


6 


1174 


17 


US-10-611-210-7 


Sequence 7, Appli 


39 


117.4 


10. 


6 


1331 


14 


US-10-044-592-73 


Sequence 73, Appl 


40 


116.8 


10. 


5 


1173 


17 


US-10-611-210-6 


Sequence 6, Appli 


41 


116 


10. 


5 


116 


9 


US-09-828-538-8 


Sequence 8, Appli 


42 


114.2 


10. 


3 


1174 


17 


US-10-611-210-8 


Sequence 8, Appli 


43 


105.6 


9. 


5 


669 


14 


US-10-044-592-70 


Sequence 70, Appl 


44 


, TO 4 


9. 


4 


436 


13 


US-10-027-632-275272 


Sequence 275272, 


45 


104 


9. 


4 


436 


16 


US-10-027-632-275272 


Sequence 275272, 



ALIGNMENTS 



RESULT 1 
US-09-393-696-3 

; Sequence 3, Application US/09393696 
; Publication No. US20030022277A1 
; GENERAL INFORMATION: 



; APPLICANT: Human Genome Sciences, Inc. et al . 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
; FILE REFERENCE: PF168P2 

; CURRENT APPLICATION NUMBER: US/09/393, 696 

; CURRENT FILING DATE: 1999-09-10 

; EARLIER APPLICATION NUMBER: PCT/US95/05616 

; EARLIER FILING DATE: 1995-05-05 

; EARLIER APPLICATION NUMBER: US08/462,509 

; EARLIER FILING DATE: 1995-06-05 

; NUMBER OF SEQ ID NOS : 23 

; SOFTWARE: Patentln Ver. 2.0 

; SEQ ID NO 3 

LENGTH: 1110 
; TYPE: DNA 

ORGANISM: Homo sapiens 

FEATURE: 

NAME/KEY: CDS 

LOCATION: (1) . . (1110) 
US-09-393-696-3 



Query Match 100.0%; Score 1110; DB 10; Length 1110; 

Best Local Similarity 100.0%; Pred. No. 9.1e-303; 

Matches 1110; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 


l 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


l 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 


Qy 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 




Db 


61 


TCCCCTGTGCCTC CAGACT AT GAAGAT GAGTTT CT C C GCTAT CT GT GGCGT GAT TAT CT G 


120 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | | | | | | | | | | | | | | | | 




Db 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 




Db 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 | 1 1 1 1 1 I I I I I | | | | | I I I | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 I 1 1 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 II 1 1 1 1 1 1 I 1 1 1 I 1 I 




Db 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 ii 1 1 1 1 1 1 1 1 ii ii 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 r 1 1 ii 1 1 1 1 ii 1 1 1 1 1 1 r 1 1 1 1 




Db 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 



1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 4 81 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 



Qy 


541 


G CAGT C AT GCAAT C CAGC AGT GTGCTGCCT GAGCT AGC CAAC C G CAC AC GGCT CTT CT CA 


600 






1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 


Qy 


601 


CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 


660 






1 M 1 M 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 




Db 


601 


CTCT GT CAT GAAC GCT GGGC AGAT GAC CT CT AT C C CAAGAT CT AC C AC AGTT GCT T CTTT 


660 


Qy 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 


Qy 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 


Qy 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 






1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 


Qy 


841 


C GCGCCTT C CT G G CT GAAGT GAAGC AGAT GC GT GCAC GGAGGAAGACAGC CAAGAT GCTG 


900 






1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
■ ■ ■ ■ ■ >> ■ >> ■ ■ ■ ■ > ■ i • ■ ■ i t i i i i i i i t i i i i i i i i i i i i i i i i i 




Db 


841 


C GCGCCT T CCT GGCT GAAGT GAAGCAGATGC GT GCAC GGAGGAAGACAGC CAAGAT GCTG 


900 


Qy 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 






1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 

< ■ 1 < 1 > ■ ■ ■ ■ .■ i i i i i i i i i i ■ i i i i i i i i i i i i i i i i i i i i i i i i i i i 




Db 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 


Qy 


961 


AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1020 






1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


961 


AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1020 


Qy ' 


1021 


ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 


1080 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 
1 1 1 1 ' 1 1 1 i i i 1 i i i i i i i i i i i i i i i i i i i i i i i i i i i I i I I I I l l l I l l l l l l l l l l 1 




Db 


1021 


ACCTTCTCCCACTGGCTG(^TGTAC(^rrAArA<^rnrT(^nrAArnr , r* aTraTrTaraarTTr 


lUoU 


Qy 


1081 


CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 








II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1081 


CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 




RESULT 


2 






us-io- 


077-874- 


-3 





Sequence 3, Application US/10077874 
Publication No. US20020115155A1 
GENERAL INFORMATION : 

APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES : 12 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 

CITY: Rockville 

STATE: MD 



COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: ■ 

APPLICATION NUMBER: US/10/077,874 
FILING DATE: 20-Feb-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/462,509 
FILING DATE: 05-JUNE-1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Wales, Michele M. 
REGISTRATION NUMBER: 43,975 
REFERENCE/ DOCKET NUMBER: PF168P1D1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
TELEFAX : 301-309-8439 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1110 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
FEATURE: 

NAME/ KEY: CDS 
LOCATION: 1..1110 
SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
US-10-077-874-3 

Query Match 99.3%; Score 1102; DB 14; Length 1110; 

Best Local Similarity 99.5%; Pred. No. 1.6e-300; 

Matches 1105; Conservative 0; Mismatches 5; Indels 0; Gaps 0 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | I I I I I I I 
Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I | | | | | | | | | | | | I I I 
Db 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

IN I I I I I I I I I I I II I II I I I I I I I I I 

Db 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I II I I I I I I I I I 

Db 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 



Qy 

Db 



241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 



301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

601 CT CT GT CAT GAACGCT GGGCAGATGACCT CTAT C CCAAGATCTAC CACAGTTGCTT CTTT 660 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I 
601 GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | I I I I I 
781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

841 CGCGCCTTCCT GGCT GAAGTGAAGCAGAT GCGT GCACGGAGGAAGACAGCCAAGAT GCT G 900 

901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I 
961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 102 0 

1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

1081 CT CAGT GGCCT T CC CT GGAGT CT GCT CT AA 1110 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1081 CTCAGTGGCCTTCCCTGGAGTCTGCTCTAA 1110 



RESULT 3 . 
US-10-225-567A-367 

; Sequence 367, Application US/10225567A 

; Publication No. US20030113798A1 

; GENERAL INFORMATION: 

; APPLICANT: Lifespan Biosciences 

APPLICANT: Brown, Joseph P. 
; APPLICANT: Burmer, Glenna C. ^ 
; APPLICANT: Roush, Christine L. 

; TITLE OF INVENTION: ANTIGENIC PEPTIDES AND ANTIBODIES FOR G PROTEIN-COUPLED 

RECEPTORS (GPCRS) 

; FILE REFERENCE: 1920-4-4 

; CURRENT APPLICATION NUMBER: US/10/225, 567A 

; CURRENT FILING DATE: .2001-12-19 

; PRIOR APPLICATION NUMBER: 60/257,144 

; PRIOR FILING DATE: 2000-12-19 

; NUMBER OF SEQ ID NOS : 2292 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 367 
; LENGTH: 1564 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-225-567A-367 

Query Match 97.2%; Score 1079.4; DB 15; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 4.2e-294; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0 
Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 




Db 



154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 



Qy 



Db 



61 TC C C CT GT GC CT CCAGACT ATGAAGAT GAGT TT CT CC GCT AT CT GT GGC GT GAT TAT CT G 12 0 
I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 



Qy 



121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 




Db 



274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 



Db 



Qy 



181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I II I 
334 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 



Qy 



241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 



Db 



Qy 



301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
454 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 



Db 



Qy 

Db 



361 
514 



420 
573 



APPLICANT: Weich, Nadine S. 
APPLICANT: Kelly, Louise M. 

TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR TREATING 

TITLE OF INVENTION: HEMATOLOGICAL DISORDERS USING 131, 148, 199, 12303, 
13906, 

; TITLE OF INVENTION: 15513, 17822, 302, 5677, 194, 14393, 28059, 7366, 12212 
; TITLE OF INVENTION: 1981, 261, 12416, 270, 1410, 137, 1871, 13051, 1847, 
1849, 

TITLE OF INVENTION: 15402, 340, 10217, 837, 1761, 8990 OR 13249 MOLECULES 
FILE REFERENCE: MPI 02-01 9P1RNOMNIM 
CURRENT APPLICATION NUMBER: US/10/352, 684A 
CURRENT FILING DATE: 2003-01-28 
PRIOR APPLICATION NUMBER: US 60/354,333 
PRIOR FILING DATE: 2002-02-04 
PRIOR APPLICATION NUMBER: US 60/360,258 
PRIOR FILING DATE: 2002-02-28 
PRIOR APPLICATION NUMBER: US 60/364,476 
PRIOR FILING DATE: 2002-03-15 
PRIOR APPLICATION NUMBER: US 60/375,626 
PRIOR FILING DATE: 2002-04-26 
PRIOR APPLICATION NUMBER: US 60/386,494 
PRIOR FILING DATE: 2002-06-06 
PRIOR APPLICATION NUMBER: US 60/390,965 
PRIOR FILING DATE: 2002-06-24 
PRIOR APPLICATION NUMBER: US 60/392,480 
PRIOR FILING DATE: 2002-06-28 
PRIOR APPLICATION NUMBER: US 60/394,128 
PRIOR FILING DATE: 2002-07-03 
PRIOR APPLICATION NUMBER: US 60/399,783 
PRIOR FILING DATE: 2002-07-31 
PRIOR APPLICATION NUMBER: US 60/403,221 
PRIOR FILING DATE: 2002-08-13 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS:_ 62 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 21 
LENGTH: 1564 
TYPE: DNA 

ORGANISM: Homo Sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (154) . . . (1431) 
US-10-352-684A-21 



Query Match 91.2%; Score 1079.4; DB 16; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 4.2e-294; 

Matches 1083; Conservative 0; Mismatches 6; Indels 0; Gaps 0; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 154 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 273 



Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 



I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 



Db 


274 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


333 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






M M 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I | I | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


334 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


393 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


394 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


453 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


454 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


513 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


514 


GTCAT C C CCTAT CT ACAGG CT GTGT C C GTGT CAGT GGCAGT GCT AACT CT CAGCT T CAT C 


573 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


574 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


633 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


634 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


693 


Qy 


541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 






1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


694 


GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


753 


Qy 


601 


CT CT GT CAT GAAC G CT GGGC AGAT GAC CT CT AT C C CAAGAT CT ACC AC AGTT GCT T CTTT 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


754 


GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 


813 


Qy 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I | 




Db 


814 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


873 


Qy 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | | | | 




Db 


874 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


933 


Qy 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


934 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


993 


Qy 


841 


CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 


900 






1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II | | || | | | | | | | | | 




Db 


994 


CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 


1053 


Qy 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1054 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 


1113 


Qy 


961 


AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 


1020 



1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

I I I I I M I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 1089 

I I II I I I I I 
Db 1234 CTCAGTGGC 1242 



RESULT 5 
US-09-393-696-1 

; Sequence 1, Application US/09393696 
; Publication No. US20030022277A1 
; GENERAL INFORMATION: 

APPLICANT: Human Genome Sciences, Inc. et al . 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
; FILE REFERENCE: PF168P2 

; CURRENT APPLICATION NUMBER: US/09/393,696 

; CURRENT FILING DATE: 1999-09-10 

; EARLIER APPLICATION NUMBER: PCT/US95/ 05616 

; EARLIER FILING DATE: 1995-05-05 

; EARLIER APPLICATION NUMBER: US08/462,509 

; EARLIER FILING DATE: 1995-06-05 

; NUMBER OF SEQ ID NOS : 23 

; SOFTWARE: Patentln Ver. 2.0 

; SEQ ID NO 1 

LENGTH: 1209 

TYPE: DNA 
; ORGANISM: Homo sapiens 
; FEATURE : 

NAME/ KEY: CDS 

LOCATION: (1)..(1209) 
US-09-393-696-1 



Query Match 97.1%; Score 1077.8; DB 10; Length 1209; 

Best Local Similarity 99.4%; Pred. No. l.le-293; 

Matches 1082; Conservative 0; Mismatches 7; Indels 0; Gaps 0; 



Qy 


i 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 






1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 




Db 


i 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 


Qy 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 IT 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCCCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | | | | | | M | 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCT-GCCTG 300 

301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCT7UVCTCTCAGCTTCATC 420 

421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 480 

481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

M I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
541 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

601 CT CT GT CAT GAAC GCT GG G CAGAT GAC CT CTAT C CCAAGAT CT ACC ACAGTT GCT T CTTT 660 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
601 GTCT GT CAT GAAC GCT GGGCAGAT GAC CT CTAT CCCAAGAT CTACCACAGTTGCTTCTTT 660 

661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 78 0 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
721 AACCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 84 0 

841 CGCGCCTT CCTGGCTGAAGT GAAGCAGAT GCGT GCACGGAGGAAGACAG CCAAGAT GCTG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
841 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 900 

901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCC7^ACCCCATCATCTACAACTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | 
1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 

1081 CTCAGTGGC 1089 
I I I I I I I I I 



Db 1081 CTCAGTGGC 1089 



RESULT 6 
US-10-077-874-1 

; Sequence 1, Application US/10077874 
; Publication No. US20020115155A1 
; GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al 

TITLE OF INVENTION: Human Neuropeptide Receptor 
NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Human Genome Sciences, Inc. 

STREET: 9410 Key West Avenue 
CITY: Rockville 
; STATE: MD 

COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/077,874 
FILING DATE: 20-Feb-2002 
; CLASSIFICATION: <Unknown> 

; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/462,509 
FILING DATE: 05-JUNE-1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Wales, Michele M. 
; REGISTRATION NUMBER: 43,975 

REFERENCE/ DOCKET NUMBER: PF168P1D1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
TELEFAX: 301-309-8439 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1209 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
FEATURE: 

; NAME/ KEY: CDS , 

LOCATION: 1..1209 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-10-077-874-1 

Query Match 97.1%; Score 1077.8; DB 14; Length 1209; 

Best Local Similarity 99.4%; Pred. No. l.le-293; 

Matches 1082; Conservative 0; Mismatches 7; Indels 0; Gaps 0 

QY 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 



Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db. 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 



TCCCCTGTGCCTC C AGACT AT GAAGAT GAGTT T CT C C GCTAT CT GT GG C GT GAT TAT CT G 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 12 0 

TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 18 0 

CT GGT GGGCAACAC GCT GGT CT GC CT GGCCGT GT GGC G GAAC CAC CAC AT GAGGAC AGTC 240 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 24 0 

ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 42 0 
I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 4 8 0 

I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I II I 
GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 4 8 0 

GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 54 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I 
GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

I I M I I I I I III I I I I I I I I 1 I 1 I I I I I I I I I I I I I | | | | | | 

GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 600 

CTCT GTCAT GAAC GCT GGGCAGAT GAC CT CTAT C C CAAGAT CTAC CACAGT TGCTTCTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

7\AGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGG7\ACTGGAAGCGC 7 8 0';. 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 
I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M | | | | 
CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGCC 840 

CGC GCCTT C CT GGCT GAAGT GAAGCAGATGCGT GCACGGAGGAAGACAGCCAAGAT GCT G 900 
I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I | | | | | | I | 
CGCGCCTT CCTGGCTGAAGT GAAGCAGAT GCGT GCACGGAGGAAGACAGCCAAGATGCT G 900 



Qy 

Db 



901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

M I I I I I I I I I I I I I I I I I i I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 



Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I II I I II I I I 
Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 AC C T T CT C C C ACT GGC T G GT GT AC GC C AAC AGC G CT GC CAAC C C CAT CAT CT ACAACTT C 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

Qy 1081 CTCAGTGGC 1089 

I I I I I I I II 

Db 1081 CTCAGTGGC 1089 



RESULT 7 

US-09-828-538-23 

Sequence 23, Application US/09828538 
Patent No. US20010025031A1 
GENERAL INFORMATION: 
APPLICANT: Ellis, Catherine E. 
APPLICANT: Kwok, Cheni 
APPLICANT: Bodsworth, Nicola J. 
APPLICANT: Halsey, Wendy 
APPLICANT: Van Horn, Stephanie 

TITLE OF INVENTION: HFGAN72 Receptor Genomic DNA and Methods 
TITLE OF INVENTION: of Use Thereof in Diagnostic Applications 
FILE REFERENCE: GH-50038-C1 
CURRENT APPLICATION NUMBER: US/09/828 , 538 
CURRENT FILING DATE: 2001-04-06 
PRIOR APPLICATION NUMBER: 60/088,624 
PRIOR FILING DATE: 1998-06-08 
PRIOR APPLICATION NUMBER: 60/093,726 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: 09/328,014 
PRIOR FILING DATE: 1999-06-08 
NUMBER OF SEQ ID NOS : 24 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 23 
LENGTH: 1564 
TYPE: DNA 

ORGANISM: HOMO SAPIENS 
US-09-828-538-23 

Query Match 97.1%; Score 1077.8; DB 9; Length 1564; 

Best Local Similarity 99.4%; Pred. No. 1.2e~293; 

Matches 1082; Conservative 0; Mismatches 7; Indels 0; Gaps 0; 

Qy i atggagccctcagccaccccaggggcccagatg'ggggtcccccctggcagcagagagccg 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

D b 154 ATGGAGCCCTCAGCCACCCCAGGGG'CCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 213 

Qy 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 120 

I I I I M I I I I I I I I I I I I I I I I I M I I I I I | | | | | | II I I I I I I I I I I 

Db 214 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGCGATTATCTG 273 



Qy 121 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 274 TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 333 

Qy 181 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 240 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 334 CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 393 

Qy 241 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 394 ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 453 

Qy 301 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 360 

i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | . 
Db 454 CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 513 

Qy 361 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 514 GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 573 

Qy 421 CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 4 80 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | M | | | | | M I I I I I I I I 
Db 574 GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 633 

Qy 481 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I 
Db 634 GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 693 

Qy 541 GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCC7UVCCGCACACGGCTCTTCTCA 600 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | 
Db 694 GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 753 

Qy 601 CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 754 GT CT GT GAT GAAC GCT GGGCAGAT GAC CT CT AT CC CAAGAT CT AC CACAGT T GCTT CTTT 813 

Qy 661 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 814 ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 873 

Qy 721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | 
Db 874 AAGCT C T GGGG C C GC C AGAT C C C CGG C ACCAC CTCAGCACT GGT GC G GAACT G GAAGC GC 933 

Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGG^b 840 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 934 C C CT CAGACCAGCTGGGGGAC CT GGAGC AGGG C CT GAGT GGAGAGC C C CAGC C CC G GGGC 993 

Qy 841 CGCGCCTTCCTGGCT GAAGT GAAGCAGAT GC GT GCAC G GAGGAAGAC AGCCAAGAT GCT G 900 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 994 CGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTG 1053 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1054 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 1113 



Qy 

Db 



961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I M I I I I I M I I I I I I I I I I I I I ! I I I I I I | I I M | I I | | | I | | | | | | | | | | | | | 

1114 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1173 



Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I 

Db 1174 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1233 

Qy 1081 CTCAGTGGC 1089 

I I I I I I I I I 
- Db 1234 CTCAGTGGC 1242 



RESULT 8' 
US-10-077-874-5 

; Sequence 5, Application US/10077874 
; Publication No. US20020115155A1 
GENERAL INFORMATION: 

APPLICANT: Soppet, Daniel et al 
; TITLE OF INVENTION: Human Neuropeptide Receptor 

NUMBER OF SEQUENCES: 12 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Human Genome Sciences, Inc. 
STREET: 9410 Key West Avenue 
CITY: Rockville 
; STATE: MD 

COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

; ^ SOFTWARE: Patentln Release #1.0, Version #1.30 

; ' CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 10/ 077 , 874 

FILING DATE: 20-Feb-2002 

CLASSIFICATION: <Unknown> 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/462,509 

FILING DATE: 05-JUNE-1995 
; ATTORNEY/ AGENT INFORMATION: 

NAME: Wales, Michele M. 

REGISTRATION NUMBER: 43,975 

REFERENCE/ DOCKET NUMBER: PF168P1D1 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 301-S09-8504 

TELEFAX: 301-309-8439 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1116 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: single 
; TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic) 
; * FEATURE : 

NAME/KEY: CDS 

LOCATION: 1..1116 



SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-10-077-874-5 



Query Match 96.9%; Score 1075.2; DB 14; Length 1116; 

Best Local Similarity 99.3%; Pred. No. 5.9e-293; 

Matches 1080; Conservative 0; Mismatches 8; Indels 0; Gaps 0; 



Qy 


1 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 


60 






1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 




Db 


1 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGACCCC 


60 


Qy 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 






1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 II 1 1 1 1 1 1 1 1 1 1 
■ i ■ i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i j i i i i i i i i i i i i i i i i i i | | | | 




Db 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGATTATCTG 


120 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 M 1 M 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 I 1 I 1 I I I I I I I I 1 1 I 1 I I 1 1 1 1 I 1 I I 1 I 
■ • ■ ■ ■ • ■ ■ iii t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i j i i i i i i i [ j 




Db 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I M I I I I I I | I I 1 I 1 | 1 1 1 1 1 1 1 1 1 1 | 1 1 
• ■ * < ■ ■ < ■ ■ ■ ■> ■ ■ ■ ■ * ■ ■ ■ ■ i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 1 i 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


-/ V VJ 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
■ i i i i i * i i i i i i i i i » i i i i i i i i i i i i i i \ i i i i i i i i i i i i i i i i i i i \ \ i i i i i i | 




Db 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
■ * <■ i ■ ■ * * ■ i • i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 




Db 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I 1 




Db 


421 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 | | | I I I I I M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 
* * 1 1 ■ 1 1 1 1 1 • 1 1 >' i * i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i j i i j i i i i 




Db 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 


Qy 


541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 






1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


541 


GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 


Qy 


601 


CT CT GT CAT GAAC G CT GGGCAGATGAC CT CT AT C C CAAGATCT AC CAC AGT TGCTTCTTT 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


601 


GT CT GT GAT GAAC GCT GGGC AGAT GAC CT CT AT C CCAAGAT CT ACC ACAGTT GCT T CTT T 


660 


Qy 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 



Qy 



721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I | 



Db 



721 AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 780 



Qy 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 840 

Qy 841 C GCGC CT TCCT GGCT GAAGT GAAGC AGAT GC GT GCAC GGAGGAAGACAGCCAAGAT GCT G 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I 
Db 841 CGCGCCTTCCTGGCT GAAGT GAAGCAGAT G C GT GCAC GGAGGAAGACAGC CAAGAT G CT G 900 

Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | 
Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTAC7UVCTTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 1021 ACCTTCTCCCACT GGCT GGTGTACGCCAACAGCGCTGCCAACCCCAT CAT CTACAACTTC 1080 

Qy 1081 CTCAGTGG 1088 

I I I I I I I I 

Db 1081 CTCAGTGG 1088 



RESULT 9 
US-09-393-696-5 

; Sequence 5, Application US/09393696 
; Publication No. US20030022277A1 
; GENERAL INFORMATION: 

; APPLICANT: Human Genome Sciences, Inc. et al . 
; TITLE OF INVENTION: Human Neuropeptide Receptor 
; FILE REFERENCE: PF168P2 

; CURRENT APPLICATION NUMBER: US/09/393,696 

; CURRENT FILING DATE: 1999-09-10 

; EARLIER APPLICATION NUMBER: PCT/US95/05616 

; EARLIER FILING DATE: 1995-05-05 

; EARLIER APPLICATION NUMBER: US08/462,509 

; EARLIER FILING DATE: 1995-06-05 

; NUMBER OF SEQ ID NOS : 23 

; SOFTWARE: Patent In Ver. 2.0 

; SEQ ID NO 5 

LENGTH: 1133 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (1) . . (1131) 
US-09-393-696-5 

Query Match 96.9%; Score 1075.2; DB 10; Length 1133; 

Best Local Similarity 99.3%; Pred. No. 6e-293; 

Matches 1080; Conservative 0; Mismatches 8; Indels 0; Gaps 0; 



Qy 



1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 



Db 


1 


ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGACCCC 


60 


Qy 


61 


T C C C CT GT GCCT C C AGACT AT GAAGAT GAGT T T CT C C G CT AT CT GT GGC GT GAT T AT CT G 


120 






II 1 1 1 II 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCCGCTATCTGTGGCGTGAT-TATCTG 


120 


Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 






1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 U 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 M 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 




Db 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I 




Db 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 


Qy 


541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 






1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 I I I I | | 




Db 


541 


GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 


Qy 


601 


CTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 


660 






1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


GTCTGTGATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTT 


660 


Qy 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


, 661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 


Qy 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 | | | | | | | 




Db 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 


Qy 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 






M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 1 1 1 1 1 1 1 1 1 I I I I I I 




Db 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 


Qy 


841 


C GC GCCTT C CT GG CT GAAGT GAAGCAGAT G C GT GCAC GGAGGAAGAC AGC CAAGAT G CT G 


900 



1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 



841 CGCGCCTTCCTGGCT GAAGT GAAG CAGAT GC GT GCAC GGAGGAAGAC AGC CAAGAT GCT G 900 



Qy 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 960 

I I I M I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | I I I I I I I I I I I I I I I || 

Db 901 ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 960 

QY 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 

Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

Qy 1081 CTCAGTGG 1088 

I I II II I I 
Db 1081 CTCAGTGG 1088 



RESULT 10 
US-09-826-509-548 

; Sequence 548, Application US/09826509 
; Publication No. US20030204073A1 
; GENERAL INFORMATION: 

• ; APPLICANT: Lehmann-Bruinsma, Karin 
; APPLICANT: Liaw, Chen W. 
; APPLICANT: Lin, I-Lin 

; TITLE OF INVENTION: No. US20030204073A1-Endogenous, Constitutively Activated 
Known G 

; TITLE OF INVENTION: Protein-Coupled Receptors 
; FILE REFERENCE: AREN-207 

; CURRENT APPLICATION NUMBER: US/ 09/826, 509 

; CURRENT FILING DATE: 2001-04-05 

; PRIOR APPLICATION NUMBER: 60/195,747 

; PRIOR FILING DATE: 2000-04-07 

; PRIOR APPLICATION NUMBER: 09/170,496 

; PRIOR FILING DATE: 1998-10-13 

; NUMBER OF SEQ ID NOS: 589 

; SOFTWARE: Patentln Version 2.1 

; SEQ ID NO 548 

LENGTH: 1278 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-09-826-509-548 

Query Match 96.8%; Score 1074.6; DB 11; Length 1278.; — 

Best Local Similarity 99.2%; Pred. No. 9e-293; 

Matches 1080; Conservative 0; Mismatches 9; Indels 0; Gaps 0; 

Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | I I I I I I I I I I I I I I I I I | | | | | | | | | M 
Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

Qy 61 T C CCCT GT GCCT C CAGACTAT GAAGAT GAGT TT CT C C GCTAT CT GT G GC GT GATT AT CT G 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | I I | | | | | | | | || | | 
Db 61 TCCCCTGTGCCTCCAGACTATGAAGATGAGTTTCTCC GCTAT CTGTGGCGTGATTATCTG 120 



Qy 


121 


TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


UD 


Izl 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 

TACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCC 


180 


Qy 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 






1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | || | | | | | | | | M | | | | | | | | | | | | | | | | 




Db 


181 


CTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTC 


240 


Qy 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 




T-\1_ 
DD 


241 


ACCAACTACTTCATTGTCAACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTG 


300 


Qy 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 






1 II 1 1 II 1 M 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I I I I | || | | | | | | | | | | | || || || | | | | | | || 




Db 


301 


CCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAG 


360 


Qy 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 






1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


GTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATC 


420 


Qy 


421 


CCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


GCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGG 


480 


Qy 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


481 


GCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCT 


540 


Qy 


541 


GCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 






1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


GCAGTCATGGAATGCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCA 


600 


Qy 


601 


CT CT GT CAT GAAC GCT GGGC AGAT GAC CT CT AT CCCAAGAT CTAC C AC AGTT GCTT CTT T 


660 






Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


GT CT GTGAT GAAC GCT GGGCAGATGAC CT CT AT C CCAAGAT CTAC C ACAGTTGCT T CT T T 


660 


Qy 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 






1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | || | | | | || 




Db 


661 


ATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGC 


720 


Qy 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 | | | | 




Db 


721 


AAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGC 


780 


Qy 


781 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 






M 1 1 1 1 1 1 1 1 1 1 MM M II 1 1 II 1 1 1 1 1 1 1 1 II 1 II II 1 1 II | || || | | || || | || | | | 






TOT 

/ol 


CCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGC 


840 


Qy 


841 


C GC GCCTTCCT GG CT GAAGT GAAGCAGAT GC GT GCAC GGAGGAAGACAGC CAAGAT GCT G 


900 






1 M 1 1 1 1 1 1 1 1 1 M 1 II 1 II 1 1 II 1 II II 1 1 1 II 1 1 1 1 I I | | | | | | | | I I I I I | | | | 




Db 


841 


C G C GCCTT C CT GGCTGAAGT GAAGCAGAT GC GT GCAC GGAGGAAGACAAAAAAGAT GCT G 


900 


Qy 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTT 


960 






1 M M M II 1 1 II 1 II 1 1 1 1 1 1 Illlll III II HUM 1 1 | MM 1 III 1 1 




Db 


901 


ATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTGCCCATCAGCGTCCTCAATGTCCTT 


960 



Qy 961 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 96-1 AAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTC 1020 



Qy 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 108 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 ACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTC 1080 

Qy 1081 CTCAGTGGC 1089 

I I I I I I I I I 

Db 1081 CTCAGTGGC 1089 



RESULT 11 
US-09-730-931-1 

; Sequence 1, Application US/09730931 

; Patent No. US20020064814A1 

; GENERAL INFORMATION: 

; APPLICANT: ELLIS, CATHERINE E. 

; TITLE OF INVENTION: DOG OREXIN 1 RECEPTOR 

FILE REFERENCE: GH-70669 
; CURRENT APPLICATION NUMBER: US/09/730, 931 
; CURRENT FILING DATE: 2000-12-06 
; PRIOR APPLICATION NUMBER: 60/169,373 
; PRIOR FILING DATE: 1999-12-07 
; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 1 
; LENGTH: 1281 
TYPE: DNA 

ORGANISM: CANIS FAMILIARIS 
US-09-730-931-1 

Query Match 81.5%; Score 905; DB 9; Length 1281; 

Best Local Similarity 89.9%; Pred. No. 6.2e-245; 

Matches 984; Conservative 0; Mismatches 105; Indels 6; Gaps 1; 



Qy 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGATGGGGGTCCCCCCTGGCAGCAGAGAGCCG 60 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I III I I I I I III II I I I I I I 
Db 1 ATGGAGCCCTCAGCCACCCCAGGGGCCCAGACTGGGACCCCCACCGGCGGCGGGGAGCTG 60 

Qy 61 TCCCCT— GT GC CT CCAGACT AT GAAGAT GAGTTT CT C C GCT AT CTGT GGC GT GAT 114 

M M I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I II I 

Db 61 TCTCCGTCACTGGTGCCTCCCGACTATGAAGACGAGTTCCTGCGCTATCTGTGGCGCGAT 120 

Qy 115 TATCTGTACCCAAAACAGTATGAGTGGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTC 174 

M I I I I I I I I I I I MINIMI 1-1 I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 121 TACCTGTACCCAAAGCAGTATGAGTGGGTCCTCATCGCTGCCTACGTGGCTGTGTTCCTA 180 

Qy 175 GTGGCCCTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGCGGAACCACCACATGAGG 234 

M I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II II I 

Db 181 GTGGCCCTGGTGGGCAACACGCTGGTCTGCCTGGCCGTGTGGAGGAACCACCACATGAGG 240 

Qy 235 ACAGT C ACCAACT ACT T CATTGTCAAC CT GT C C CT GGCT GAC GT T CT GGT GACT GCT AT C 294 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I II Ml 

Db 241 ACGGTCACCAACTATTTCATTGTCAACCTGTCCCTGGCTGATGTGCTGGTGACAGCCATC 300 

Qy 295 TGCCTGCCGGCCAGCCTGCTGGTGGACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTC 354 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 1 III II I II 

301 TGCCTCCCGGCCAGCCTGCTGGTAGACATCACTGAGTCCTGGCTCTTCGGTCATACCCTC 360 

355 TGCAAGGTCATCCCCTATCTACAGGCTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGC 414 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I II II 
361 T GCAAAGT CAT C C C CT ACCT AC AGGC C GT GT CT GTGT C GGT GG C AGT GCT GACT CT CAGC 420 

415 TTCATCCCCCTGGACCGCTGGTATGCCATCTGCCACCCACTATTGTTCAAGAGCACAGCC 474 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I II I I I I I I I III ■ 
421 TTCATCGCCCTGGACCGCTGGTATGCCATCTGCCACCCGCTGTTGTTCAAGAGCACCGCC 480 

475 CGGCGGGCCCGTGGCTCCATCCTGGGCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCC 534 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
481 CGGCGCGCCCGCAGCTCCATCCTGGGCATCTGGGCTGTGTCATTGGCTGTCATGGTACCT 540 

535 CAGGCTGCAGTCATGCAATCCAGCAGTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTC 594 

I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II III 
541 CAGGCTGCCGTCATGGAATGCAGCAGCGTGCTCCCTGAGCTAGCCAACCGCACCCGCCTC 600 

595 TTCTCACTCTGTCATGAACGCTGGGCAGATGACCTCTATCCCAAGATCTACCACAGTTGC 654 

I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
601 T TCTCT GT GT GT GAT GAACACT GGGCAGAT GAC CT CT AT C C CAAGAT CTAC C ACAGTT GC 660 

655 TTCTTTATTGTCACCTACCTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATA 714 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I II I I II I I I I 
661 TTCTTCATTGTCACCTACTTGGCCCCACTGGGCCTCATGGCCATGGCCTATTTCCAGATC 720 

715 TTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACCACCTCAGCACTGGTGCGGAACTGG 774 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I M I I I I I II I 
721 TTCCGCAAGCTCTGGGGCCGCCAGATCCCTGGCACCACATCGGCCCTGGTGAGGAACTGG 780 

775 AAGCGCCCCTCAGACCAGCTGGGGGACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCC 834 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II 
781 AAGCGGCCCTCGGACCAGCTGGAGGACCAGGGGCCCGGCCTGAGCGCGGAACCCCCCCCT 840 

835 CGGGGCCGCGCCTTCCTGGCTGAAGTGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAG 894 

I I I I III I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
841 CGGGCCCGGGCCTTCCTGGCTGAGGTGAAGCAGATGCGAGCGCGGAGGAAGACGGCCAAG 900 

895 ATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAAT 954 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
901 ATGCTGATGGTGCTGCTGCTGGTCTTTGCCCTCTGCTACCTGCCCATCAGTGTCCTCAAT 960 

955 GTCCTTAAGAGGGTGTTCGGGATGTTCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCC 1014 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
961 GT C C T CAAGAG GGT GT T C GGGATGT T C C GC CAAT CCAGT GAC C GAGAAGCCGT GT AC GC C 1020 

1015 TGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAACAGCGCTGCCAACCCCATCATCTAC 1074 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1021 TGCTTCACCTTCTCCCACTGGCTGGTGTATGCCAACAGCGCTGCCAACCCCATCATCTAC 1080 

1075 AACTTCCTCAGTGGC 108 9 

I I I I I I I I I I I III 
1081 AACTTCCTCAGCGGC 1095 



US-10-278-087A-55 

; Sequence 55, Application US/10278087A 
; Publication No. US20030138817A1 
GENERAL INFORMATION: 

APPLICANT: Shu j i Hinuma 
; Yasuaki Ito 

; Ryo Fujii 

TITLE OF INVENTION: G Protein Coupled Receptor Protein, 
; Production, And Use Thereof 

; NUMBER OF SEQUENCES: 61 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Edwards & Angell, LLP 

STREET: 101 Federal Street 

CITY: BOSTON 

STATE: MA 

COUNTRY: USA 

ZIP: 02209 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/10/278, 087A 

FILING DATE: 31-Jan-2003 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/461,436 

FILING DATE: 14-DEC-1999 
; APPLICATION NUMBER: 09/038,572 

FILING DATE: ll-MAR-1998 
; APPLICATION NUMBER: 08/513,974 

FILING DATE: 14-SEP-1995 

APPLICATION NUMBER: PCT/ JP95/01599 

FILING DATE: 10-AUG-1995 

APPLICATION NUMBER: 7-093989 
; FILING DATE: 19-APR-1995 

APPLICATION NUMBER: 7-057186 

FILING DATE: 16-MAR-1995 

APPLICATION NUMBER: 7-007177 

FILING DATE: 20-JAN-1995 

APPLICATION NUMBER: 6-326611 

FILING DATE: 28-DEC-1994 
; APPLICATION NUMBER: 6-270017 

; FILING DATE: 02-NOV-1994 

- ^ APPLICATION NUMBER: 6-236357 

FILING DATE: 30-SEP-1994 

APPLICATION NUMBER: 6-236356 

FILING DATE: 30-SEP-1994 

APPLICATION NUMBER: 6-189274 
; FILING DATE: ll-AUG-1994 

APPLICATION NUMBER: 6-189273 

FILING DATE: ll-AUG-1994 

APPLICATION NUMBER: 6-189272 

FILING DATE: ll-AUG-1994 
ATTORNEY/AGENT INFORMATION: 

NAME: CONLIN, DAVID G. 



; REGISTRATION NUMBER: <Unknown> 

; ' REFERENCE/ DOCKET NUMBER: 45753 DIV3 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 617-439-4444 
; TELEFAX: 617-439-4170 

INFORMATION FOR SEQ ID NO: 55: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 789 base pairs 
TYPE: nucleic acid 
; STRANDEDNESS: double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
; SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

US-10-278-087A-55 



Query Match 59.5%; Score 661; DB 15; Length 789; 

Best Local Similarity 89.9%; Pred. No. 3.6e~176; 

Matches 709; Conservative 0; Mismatches 80; Indels 0; Gaps 0 



Qy 


271 


GCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTG^TG^ArATfArT^Ar; 


o o w 






II II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
11 i i ii i i i i I I I I it I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 1 II 




Db 


1 


GCCGATGTGCTGGTGACAGCCATCTGCCTGCCGGCCAGTCTGCTGGTAGACATCACGGAA 


60 


Qy 


331 


TCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGGCTGTGTCCGTG 


o zj \J 






1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
' ' 11 ■■■ ■ ■ ■ i i ■ 1 I i i 1 i I i i i i i i i i i i i i i I i i 1 i i i I i i i i i i I i l l 




Db 


61 


TCCTGGCTCTTTGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGGCCGTGTCCGTG 


120 


Qy 


391 


TCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATGCCATCTGCCAC 


450 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1- 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 
i i i i i i i i i i j i i i i i j i i i i i i i i i i i i i i i i i i i i i i I i i i i i i i 




Db 


121 


TCAGTGGTCGTGCTGACTCTCAGCTCCATCGCCCTGGACCGCTGGTACGCCATCTGCCAC 


180 


Qy 


451 


CCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGGGCATCTGGGCT 


510 






II ! 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | | | 1 1 1 1 1 1 1 | | | | 1 1 1 1 1 1 I | | | | 




Db 


181 


CCGCTGTTGTTCAAGAGCACTGCCCGGCGCGCCCGCGGCTCCATCCTCGGCATCTGGGCG 


240 


Qy 


511 


GTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCAGTGTGCTGCCT 


570 






1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


241 


GTGTCGCTGGCTGTCATGGTGCCTCAGGCTGCTGTCATGGAGTGTAGCAGCGTGCTGCCC 


300 


Qy 


571 


GAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGGCAGATGACCTC 


630 






1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 1 II 1 1 1 1 1 11.11 I 1 1 1 I I I I I I I I I I | | 




Db 


301 


GAGCTGGCCAACCGCACCCGCCTCCTGTCTGTCTGTGATGAGCGCTGGGCAGACGACCTG 


360 


Qy 


631 


TATCCCAAGATCTACCACAGTTGCTTCTTTATTGTCACCTACCTGGCCCCACTGGGCCTC 


690 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 I I M 1 1 1 1 1 1 




Db 


361 


TACCCCAAGATCTACCACAGCTGCTTCTTCATTGTCACCTACCTGGCCCCACTGGGCCTC 


420 


Qy 


691 


ATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACC 


750 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


ATGGCCATGGCCTATTTCCAGATCTTCCGCAAGCTCTGGGGCCGCCAGATCCCCGGCACC 


480 


Qy 


751 


ACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGGACCTGGAGCAG 


810 






1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 




Db 


481 


ACCTCGGCCCTGGTGCGCAACTGGAAGCGGCCCTCAGACCAGCTGGACGACCAGGGCCAG 


540 



Qy 



811 GGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAGTGAAGCAGATG 87 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I | MUM 



Db 



541 GGCCTGAGCTCAGAGCCCCAGCCCCGGGCCCGCGCCTTCCTGGCCGAGGTGAAACAGATG 600 



Qy 871 CGTGCACGGAGGAAGACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGC 930 

M M II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 CGAGCCCGGAGGAAGACGGCCAAGATGCTGATGGTGGTGCTGCTGGTCTTCGCCCTCTGC 660 

Qy 931 TACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGTTCCGCCAAGCC 990 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 661 TACCTGCCCATCAGTGTCCTCAACGTCCTCAAGAGGGTCTTCGGGATGTTCCGCCAAGCC 72 0 

Qy 991 AGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAAC 1050 

II I I I I I II II I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 AGCGACCGAGAGGCCATCTACGCCTGCTTCACCTTCTCCCACTGGCTGGTGTACGCCAAC 780 

Qy 1051 AGCGCTGCC 1059 

I I I I I III 

Db 781 AGCGCCGCC 789 



RESULT 13 
US-10-282-717-1 

; Sequence 1, Application US/10282717 
; Publication No. US20030083466A1 
; GENERAL INFORMATION: 
; APPLICANT: YANAGISAWA, MASASHI 

; TITLE OF INVENTION: cDNA CLONE MY1 THAT ENCODES A NOVEL 

; TITLE OF INVENTION: HUMAN 7 -TRANSMEMBRANE RECEPTOR 

; FILE REFERENCE: GH50029D1C1 

; CURRENT APPLICATION NUMBER: US/10/282,717 

; CURRENT FILING DATE: 2002-10-28 

PRIOR APPLICATION NUMBER: 09/676,625 
; PRIOR FILING DATE: 2000-10-02 
; PRIOR APPLICATION NUMBER: 09/119,788 
; PRIOR FILING DATE: 1998-07-21 
; PRIOR APPLICATION NUMBER: 60/053,790 

PRIOR FILING DATE: 1997-07-25 
; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 1 

LENGTH: 1633 

TYPE: DNA 

ORGANISM: HOMO SAPIENS 
US-10-282-717-1 

Query Match 46.1%; Score 511.8; DB 15; Length 1633; 

Best Local Similarity .70.0%; Pred. No. 5.2e-134; 

Matches 706; Conservative 0; Mismatches 297; Indels 6; Gaps 1; 



Qy 8 0 AT GAAGAT GAGTTT CT C C GCT AT CT GT GGC GT GATT AT CT GT AC C CAAAACAGT AT GAGT 139 

I M M M M M II II I I I I I I I II II III I II I III I I I I I I I I 

Db 217 ACGACGAGGAATTCCTGCGGTACCTGTGGAGGGAATACCTGCACCCGAAAGAATATGAGT 276 

Qy 14 0 GGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCCCTGGTGGGCAACACGCTGG 199 

I I I I I I I II I I I I II I I I I I I I II I I I II I I I I I I I II I II I 

Db 277 GGGTCCTGATCGCCGGGTACATCATCGTGTTCGTCGTGGCTCTCATTGGGAACGTCCTGG 336 

Qy 2 00 TCTGCCTGGCCGTGTGGC GGAACCAC C ACAT GAGGACAGT CAC CAACT ACT T CAT T GT CA 259 



I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 

337 TTT GT GT GGCAGT GT GGAAGAAC CAC C ACAT GAG GAC GGT AAC CAACTACT T C AT AGT CA 396 



260 ACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGG 319 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

397 ATCTTTCTCTGGCTGATGTGCTCGTGACCATCACCTGCCTTCCAGCCACACTGGTCGTGG 45 6 

320 ACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGG 379 

I I I I I I I I I I I I I I I I II II II II I I I I I I I II II II I I I I I I I I I 
4 57 ATATCACTGAGACCTGGTTTTTTGGACAGTCCCTTTGCAAAGTGATTCCTTATCTACAGA 516 

38 0 CTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATG 439 

I I I I I I I I I I I IN I II II II II I I I I III II I I I | II I I I I I I I 
517 CCGTGTCGGTGTCTGTGTCTGTCCTCACACTGAGCTGTATCGCCTTGGATCGGTGGTATG 576 

440 CCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGG 4 99 

I I I I I I II I I I I I I I I I I I I I I I I I I I III Ill I 

577 CAATCTGTCACCCTTTGATGTTTAAGAGCACAGCAAAGCGGGCCCGTAACAGCATTGTCA 636 

500 GCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCTVATCCAGCA 559 

I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I 

637 T CAT CT G GAT T GTCT C CT G C ATT ATAAT GATTCCT CAGGCCAT C GT CAT GGAGT GCAGCA 696 

560 GTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGG 619 

I I I I I I I I II I I I I I I I III I I I I I I I I I I I 

697 CCGTGTTCCCAGGCTTAGCCAATAAAACCACCCTCTTTACGGTGTGTGATGAGCGCTGGG 756 

62 0 CAGAT GAC CT CT AT C C CAAGAT CT AC CACAGTT GCT T CT TTATT GT C ACCT AC CT GGC CC 679 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

757 GTGGTGAAAT TT AT CC CAAGAT GT AC CACAT CT GT TT CTTTCT GGT GACAT ACAT GGC AC 816 

68 0 CACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGA 739 

I I I I I I I I I I I II I I I I III I II I I II I Mill I I I I I I I II I I I I 
817 CACTGTGTCTCATGGTGTTGGCTTATCTGCAAATATTTCGCAAACTCTGGTGTCGACAGA 87 6 

740 TCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGG 799 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

877 T C C C T G G AAC AT CAT CTGTAGTT C AG AG AAAAT G G AAG C C C C TGCAGCCTGTTT 930 

8 00 ACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAG 859 

II I M I I I I I I I I I I I I I I I I I II I I I 

931 CACAGCCTCGAGGGCCAGGACAGCCAACGAAGTCCCGGATGGGCGCTGTGGCGGCTGAAA 990 

860 TGAAGCAGATGCGTGCACGGAGGAAGACAGCCAAGATGCTGATGGTGGTGCTGCTGGTCT 919 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
991 TAAAGC AGAT C C GAGC C AGAAGGAAAAC AGCCC GGAT GT T GAT GGT T GT GCTTTT GGT AT 1050 

92 0 TCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGT 979 
I M I I I I I I II II II III I I I I I I I I I I II II I I I II II I I I I I I I 
1051 T T GCAAT T T GCT AT CT AC CAATT AG CAT C CT CAAT GT GCT AAAGAGAGT ATTTGGGAT GT 1110 

980 TCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGG 1039 
I III I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1111 i TT GCCCAT ACT GAAGAC AGAGAGACT GT GT ATGCCT GGT TTAC CTT T T CACACT GGCT T G 1170 

104 0 TGTACGCCAACAGCGCTGCCAACCCCATCATCTACAACTTCCTCAGTGG 1088 
I II I I I I I II I I I II II II II II II II II I I I I II I I 



Db 



1171 T AT AT GC CAAT AGT GCT GC GAAT C CAATT AT T TAT AATT T T CT C AGT G G 1219 



RESULT 14 

US-10-225-567A-369 

; Sequence 369, Application US/10225567A 

; Publication No. US20030113798A1 

; GENERAL INFORMATION: 

; APPLICANT: Lifespan Biosciences 

; APPLICANT: Brown, Joseph P. 

APPLICANT: Burmer, Glenna C. 
; APPLICANT: Roush, Christine L. 

; TITLE OF INVENTION: ANTIGENIC PEPTIDES AND ANTIBODIES FOR G PROTEIN-COUPLED 

RECEPTORS (GPCRS) 

; FILE REFERENCE: 1920-4-4 

; CURRENT APPLICATION NUMBER: US/10/225, 5 67 A 
; CURRENT FILING DATE: 2001-12-19 
; PRIOR APPLICATION NUMBER: 60/257,144 
; PRIOR "FILING DATE: 2000-12-19 
; NUMBER OF SEQ ID NOS : 2292 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 369 
; LENGTH: 1843 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-225-567A-369 

Query Match 46.1%; Score 511.8; DB 15; Length 1843; 

Best Local Similarity 70.0%; Pred. No. 5.3e-134; 

Matches 706; Conservative 0; Mismatches 297; Indels 6; Gaps 1 

Qy 80 AT GAAGAT GAGTT T CT C CG CT AT CT GT GGCGT GATT AT CT GT AC C CAAAAC AGTAT GAGT 139 

I II II II II II II II I I I I I I I II II II I I I I I III I I I I I I I I 
Db 428 AC GAC GAGGAATT C CT GCGGT ACCT GT GGAGGGAAT AC CT GCAC C C GAAAGAATAT GAGT 4 87 

Qy 140 GGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCCCTGGTGGGCAACACGCTGG 199 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 88 GGGTCCTGATCGCCGGGTACATCATCGTGTTCGTCGTGGCTCTCATTGGGAACGTCCTGG 547 

Qy 2 00 TCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTCACCAACTACTTCATTGTCA 259 

I M I I I I I I I I I I I I I I I I I I I Ml I I I I I I II I I I I I I I I I I I I I I I I It 
Db 548 T TT GT GT GGCAGT GT GGAAGAAC CAC CACAT GAGGACGGT AAC CAACTACT T CAT AGT C A 607 

Qy 2 60 ACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGG 319 

I I I I I I I I I I Ml I I I I I I I I I I II M I I II I II I II I I II II 

Db 608 ATCTTTCTCTGGCTGATGTGCTCGTGACCATCACCTGCCTTCCAGCCACACTGGTCGTGG 667 

Qy 320 ACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGG 37 9 

I I II II I II I Mill I M II II II I I II II I II II II I I II II I II 
Db 668 AT AT CACT GAGAC CT GGTT T TTTGGACAGT C C CT T T GCAAAGT GAT T CCT T AT CT ACAGA 727 

Qy 380 CTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATG 439 

I II II I Mill III I II II M II MM .III II MM M II II I I I 
Db 728 CCGTGTCGGTGTCTGTGTCTGTCCTCACACTGAGCTGTATCGCCTTGGATCGGTGGTATG 787 



Qy 44 0 CCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGG 4 99 

I I I II I I I I I I I I II I II I I II II I I I I I I II I I II I I III I 



Db 



788 CAAT CT GT C ACC CT TT GAT GT T TAAGAGCACAGCAAAGC GGGC C CGT AAC AGCAT T GT C A 847 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 



500 



848 



560 



908 



620 



968 



680 



GCATCTGGGCTGTGTCGCTGGCCATCATGGTGCGCCAGGCTGCAGTCATGCAATCCAGCA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TCATCTGGATTGTCTCCTGCATTATAATGATTCCTCAGGCCATCGTCATGGAGTGCAGCA 

GTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGG 

I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I 

CCGTGTTCCCAGGCTTAGCCAATAAAACCACCCTCTTTACGGTGTGTGATGAGCGCTGGG 



559 



907 



619 



967 



679 



CAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTTATTGTCACCTACCTGGCCC 

I I I I I I I I I I I I I I I I II I II I I I I I I I II I I I I I I I I I I I I I I 

GT GGT GAAAT T TAT C CCAAGAT GT AC CACAT CT GT TTCTTTCTGGT GACAT ACAT G GCAC 1027 



739 



CACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGA 
I I I I I I I I I I I II I I I I III I M I I I I I I I I I I I I I I I I I II I I I I 
1028 CACTGTGTCTCATGGTGTTGGCTTATCTGCAAATATTTCGCAAACTCTGGTGTCGACAGA 1087 



740 



799 



TCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGG 
I i I I II It I I I I I I I I I I I I I I I I I I I I I I I I 

108 8 TCCCT GGAACATCAT CT GTAGTTCAGAGAAAATGGAAGCCCC TGCAGCCTGTTT 1141 



800 



859 



ACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAG 
II I III I I I I I I I I I I I I I I I I I I I I I 

1142 CACAGC CT C GAGGG CCAGGACAGCCAACGAAGT C C C GGAT GAGC GCT GT GG C GGCT GAAA 1201 



860 



919 



TGAAGCAGAT GCGT GCAC GGAGGAAGACAGCCAAGAT GCT GAT GGTGGTGCT GCTGGTCT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1202 TAAAGCAGAT CCGAGCCAGAAGGAAAACAGCCCGGATGTT GAT GGTT GTGCTTTTGGTAT 1261 



920 



979 



TCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGT 
III I I I I I I II II II III I I I I I I I I I I II I I I I I III I I I I I I I 
1262 TT GCAATT T GCTAT CTAC CAAT TAG C AT CCT CAAT GT GCTAAAGAGAGTAT T T GGGAT GT 1321 

980 TCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGG 1039 

I III I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I 

1322 TTGCCCATACTGAAGACAGAGAGACTGTGTATGCCTGGTTTACCTTTTCACACTGGCTTG 1381 

104 0 T GTACGCCAACAGCGCTGCCAACCCCATCAT CT ACAACTTCCT CAGT GG 1088 

I II I I I I I II I I I I I II II II II II II II I I I I I I I I 
1382 TATATGCCAATAGT GCTGCGAAT CCAATTATTTATAATTTTCT CAGT GG 1430 



RESULT 15 
US-09-826-509-550 

Sequence 550, Application US/09826509 ... -'r- 

Publication No. US20030204073A1 
GENERAL INFORMATION: 
APPLICANT: Lehmann-Bruinsma, Karin 
APPLICANT: Liaw, Chen W. 
APPLICANT: Lin, I-Lin 

TITLE OF INVENTION: No. US20030204073A1-Endogenous , Constitutively Activated 
Known G 

TITLE OF INVENTION: Protein-Coupled Receptors 
FILE REFERENCE: AREN-207 

CURRENT APPLICATION NUMBER: US/09/826, 509 
CURRENT FILING DATE: 2001-04-05 



; PRIOR APPLICATION NUMBER: 60/195,747 

; PRIOR FILING DATE: 2000-04-07 

; PRIOR APPLICATION NUMBER: 09/170,496 

; PRIOR FILING DATE: 1998-10-13 

; NUMBER OF SEQ ID NOS : 589 

; SOFTWARE: Patentln Version 2.1 

; SEQ 1 ID NO 550 

LENGTH: 1335 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-09-826-509-550 

Query Match 45.7%; Score 507; DB 11; Length 1335; 

Best Local Similarity 69.7%; Pred. No. l.le-132; 

Matches 703; Conservative 0; Mismatches 300; Indels 6; Gaps 1; 



Qy 80 AT GAAGAT GAGT TT CT C CGCT AT CT GT GGC GT GATT AT CT GT AC C CAAAACAGT AT GAGT 139 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I' I I I I I I I I I I I I I I I 
Db 104 AC GAC GAGGAAT T C CT GC GGT AC CT GT GGAGGGAAT ACCTGCACC C GAAAGAAT AT GAGT 163 

Qy 140 GGGTCCTCATCGCAGCCTATGTGGCTGTGTTCGTCGTGGCCCTGGTGGGCAACACGCTGG 199 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I II I I I I I I I 

Db 164 GGGTCCTGATCGCCGGGTACATCATCGTGTTCGTCGTGGCTCTCATTGGGAACGTCCTGG 223 

Qy 200 TCTGCCTGGCCGTGTGGCGGAACCACCACATGAGGACAGTCACCAACTACTTCATTGTCA 259 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 224 TTT GT GT GGC AGT GT G GAAGAACCAC C ACAT GAGGAC GGTAAC CAACT ACT T C AT AGT C A 283 

Qy 260 ACCTGTCCCTGGCTGACGTTCTGGTGACTGCTATCTGCCTGCCGGCCAGCCTGCTGGTGG 319 

I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I 

Db . 284 ATCTTTCTCTGGCTGATGTGCTCGTGACCATCACCTGCCTTCCAGCCACACTGGTCGTGG 343 

Qy 320 ACATCACTGAGTCCTGGCTGTTCGGCCATGCCCTCTGCAAGGTCATCCCCTATCTACAGG 379 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 344 AT AT CACT GAGAC CT GGT T T TTTGGAC AGT CC CT T T GCAAAGT GATT C CT TAT CT ACAGA 403 

Qy 380 CTGTGTCCGTGTCAGTGGCAGTGCTAACTCTCAGCTTCATCCCCCTGGACCGCTGGTATG 439 

I I I I I I I I I I I III I II II II II I I II III II I II I II I I I I I I I 
Db 4 04 CCGTGTCGGTGTCTGTGTCTGTCCTCACACTGAGCTGTATCGCCTTGGATCGGTGGTATG 463 

Qy 440 CCATCTGCCACCCACTATTGTTCAAGAGCACAGCCCGGCGGGCCCGTGGCTCCATCCTGG 499 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I III I 
Db 464 CAAT CT GT CAC C CT T T GAT GTT TAAGAGCACAGCAAAGC GGGCCC GT AACAGCAT T GT C A 523 

Qy 500 GCATCTGGGCTGTGTCGCTGGCCATCATGGTGCCCCAGGCTGCAGTCATGCAATCCAGCA 559 

••"in 1 1 1 1 iii ii ii iii i ii Mm 1 1 1 1 1 1. \-\ him 

Db 524 T CATCT GGAT T GT CT C CT GCAT TATAAT GATT C CT CAGGC CAT CGTCAT GGAGT GCAGCA 583 

Qy 560 GTGTGCTGCCTGAGCTAGCCAACCGCACACGGCTCTTCTCACTCTGTCATGAACGCTGGG 619 

III I M I II II I I I I I I I M I I I I I I I I I I I I 

Db 584 CCGTGTTCCCAGGCTTAGCCAATAAAACCACCCTCTTTACGGTGTGTGATGAGCGCTGGG 643 

Qy 620 CAGATGACCTCTATCCCAAGATCTACCACAGTTGCTTCTTTATTGTCACCTACCTGGCCC 679 

I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I II I I I I I 

Db 644 GT GGT GAAAT T TAT C C CAAGAT GTACCACAT CTGTTTCTTTCTGGT GACAT ACAT GGC AC 703 

Qy 680 CACTGGGCCTCATGGCCATGGCCTATTTCCAGATATTCCGCAAGCTCTGGGGCCGCCAGA 739 



1 1 1 1 1 I 1 1 1 1 1 1 1 II 1 1 III I II II 1 1 1 I II 1 1 1 1 1 1 1 1 I II 1 1 1 1 

Db 704 CACT GT GT CT CAT GGTGTTGGCTTATCT GCAAATATTTCGCAAACT CTGGT GT CGACAGA 763 



Qy 74 0 TCCCCGGCACCACCTCAGCACTGGTGCGGAACTGGAAGCGCCCCTCAGACCAGCTGGGGG 799 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 764 T C CCT GGAAC AT CAT CT GT AGTT CAGAGAAAAT GGAAGC C C C TGCAGCCTGTTT 817 

Qy 8 00 ACCTGGAGCAGGGCCTGAGTGGAGAGCCCCAGCCCCGGGGCCGCGCCTTCCTGGCTGAAG 859 

II I III I I I I I I I I I I I I I I I I I I I I I 

Db 818 CACAGCCTCGAGGGCCAGGACAGCCAACGAAGTCCCGGATGAGCGCTGTGGCGGCTGAAA 877 

Qy 860 T GAAGCAGAT GC GT G C AC GGAGGAAGACAGCC AAGAT GCT GAT G GT GGT GCT GCT GGT CT 919 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 878 TAAAGC AGAT CCGAGC C AGAAG GAAAACAAAAC GGAT GTT GAT GGTT GTGCT T TT GGT AT 937 

Qy 92 0 TCGCCCTCTGCTACCTCCCCATCAGCGTCCTCAATGTCCTTAAGAGGGTGTTCGGGATGT 97 9 

I II I I I I I I II II II III I I I I I I I I I I II I I I I I II II I I I I II I 
Db 938 TTGCAATTT GCTATCTAC CAATTAGCATCCTCAAT GT GCTAAAGAGAGTATTTGGGATGT 997 

Qy 98 0 TCCGCCAAGCCAGTGACCGCGAAGCTGTCTACGCCTGCTTCACCTTCTCCCACTGGCTGG 1039 

I III I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I 

Db 998 TTGCCCATACT GAAGACAGAGAGACT GT GTATGCCT GGT-TTACCTTTT CACACT GGCTTG 1057 

Qy 1040 T GT ACGCCAACAGC GCT GC CAAC C C CAT CAT CT ACAACTTCCT CAGT GG 1088 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1058 TAT AT GC CAAT AGT GCT GC GAAT C CAAT T ATTT AT AAT T TT CT CAGT GG 1106 



Search completed: October 16, 2004, 03:40:42 
Job time : 575.496 sees 



